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Abstract. The equilibrium state of a system consisting of a large number of strongly interacting 
electrons can be characterized by its density operator. This gives a direct access to the ground- 
state energy or, at finite temperatures, to the free energy of the system as well as to other static 
physical quantities. Elementary excitations of the system, on the other hand, are described within 
the language of Green's functions, i.e. time- or frequency-dependent t/ynam/c quantities which give 
a direct access to the linear response of the system subjected to a weak time-dependent external 
perturbation. A typical example is angle-revolved photoemission spectroscopy which is linked to 
the single-electron Green's function. Since usually both, the static as well as the dynamic physical 
quantities, cannot be obtained exactly for lattice fermion models like the Hubbard model, one has 
to resort to approximations. Opposed to more ad hoc treatments, variational principles promise to 
provide consistent and controlled approximations. Here, the Ritz principle and a generalized version 
of the Ritz principle at finite temperatures for the static case on the one hand and a dynamical 
variational principle for the single-electron Green's function or the self-energy on the other hand are 
introduced, discussed in detail and compared to each other to show up conceptual similarities and 
differences. In particular, the construction recipe for non-perturbative dynamic approximations is 
taken over from the construction of static mean-field theory based on the generalized Ritz principle. 
Within the two different frameworks, it is shown which types of approximations are accessible, and 
their respective weaknesses and strengths are worked out. Static Hartree-Fock theory as well as 
dynamical mean-field theory are found as the prototypical approximations. 
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1. MOTIVATION 

To understand the physics of systems consisting of a large number of interacting 
fermions constitutes one of the main and most important types of problems in physics. In 
condensed-matter physics many materials properties are governed, for example, by the 
interacting "gas" of valence electrons. From the theoretical perspective, the Coulomb in- 
teraction among the valence electrons must be considered as strong or at least of the same 
order of magnitude as compared to their kinetic energy for transition metals and their 
oxides, for example. This implies that usual weak-coupling perturbation theory [1, 2, 3] 
does not apply. Density-functional theory (DFT) [4, 5, 6, 7, 8] can be regarded as a 
standard technique in the field of electronic- structure calculations for condensed-matter 
systems. It provides an in principle exact approach which yields the electron density and 
the energy of the ground state. In practice, however, it must be combined with approx- 
imations such as the famous local density approximation (LDA). While this DFT-LDA 
scheme has been proven to be extremely successful in predicting ground-state prop- 
erties of a large class of materials, there are also several well-known shortcomings for 
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so-called strongly correlated systems. These comprise many of 3d or 4f transition-metals 
and their oxides, for example. Another defect of the standard DFT consists in its inabil- 
ity to predict excited-state properties and the dynamic linear response. This is crucial, 
however, to make contact to experimental probes such as angle-resolved photoemission, 
for example. Interpretations of photoemission spectra are often based on the DFT-LDA 
band structure. This lacks a fundamental justification and is is essentially equivalent to a 
Hartree-Fock-like picture of essentially independent electrons. The Hartree-Fock theory 
can be derived from a "static" variational principle where the ground-state energy or, at 
finite temperatures, the grand potential is minimized when expressed in a proper way 
as a functional of the pure or mixed state of the system, respectively. This is the Ritz 
variational principle. 

Opposed to the static variational principle, however, there is a well-known "dynami- 
cal" variational principle which directly focuses on the one-electron excitation spectrum 
[9]. Here the grand potential is expressed as a functional of the one-electron Green's 
function or the self-energy and can be shown to be stationary at the respective physical 
quantity. Similar to the density functional and similar to the Ritz principle, the dynamical 
variational principle is formally exact but needs additional approximations for a practi- 
cal evaluation. Since long the approximations constructed in this way [10, 11] have been 
perturbative as they are defined via partial resummations of diagrams where contribu- 
tions at some finite order are missing. Hence, they are valid in the weak-coupling regime 
only. The question arises whether it is possible to derive approximations from a dynam- 
ical variational principle which are non-perturbative and able to access the physics of 
strongly correlated electron systems where several interesting phenomena, like sponta- 
neous magnetic order [12, 13], correlation-driven metal-insulator transitions [14, 15, 16] 
or high-temperature superconductivity [17, 18] emerge. 

Rather than starting from the non-interacting Fermi gas as the reference point around 
which the perturbative expansion is developed, a local perspective appears to be more 
attractive for strongly correlated electron systems, in particular for prototypical lattice 
models with local interaction, such as the famous Hubbard model [19, 20, 21]. The idea 
is that the local physics of a solid-state ion with a strong and due to screening effects 
essentially local Coulomb interaction is the more proper starting point for a system- 
atic theory and that a self-consistent embedding of the ion in the lattice environment 
captures the main effects. Since the invention of dynamical mean-field theory (DMFT) 
[22, 23, 24, 25, 26], a non-perturbative approximation with many attractive properties 
is available which just relies on this local perspective. The paradigmatic field of appli- 
cations for the DMFT is the Mott-Hubbard metal-insulator transition [27, 16] which, at 
zero temperature, can be seen as prototypical quantum phase transition that is driven 
by the electron-electron interaction and cannot be captured by perturbative methods. 
"Mottness", i.e. physical phenomena originating from a close parametric distance to the 
Mott transition or the Mott insulator, is also believed to be a possible key feature for an 
understanding of the many unusual and highly interesting properties of cuprate-based 
high-temperature superconductors. This example shows that the DMFT, at least as a 
starting point for further methodical improvements, nowadays appears as an attractive 
approach to the electronic structure of unconventional materials. In particular, there is 
the exciting perspective that, when combined with DFT-LDA, dynamical mean-field the- 
ory will ultimately be able to constitute a new standard for ab initio electronic- structure 
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calculations with a high predictive power. 

The DMFT can be derived in an elegant way from the dynamical variational principle. 
The purpose of these lecture notes is to demonstrate how this is achieved and whether it 
is possible to derive similar or new approximations in the same way and to characterize 
the strengths and weak points of these "dynamical" approximations. The strategy to be 
pursued here is to first understand the formalism related to the static Ritz principle and 
to show up the differences but also the close analogies with the dynamic approach. 

The notes are organized as follows: The next section introduces the systems we are 
interested in and discusses on a general level the variational approach as such. Sec. 3 
then develops the static variational principle as a generalization of the Ritz principle. 
This is used in Sec. 4 to construct static mean-field theory. To transfer the insight that 
has been gained from the static approach to the dynamic one, Sec. 5 introduces the con- 
cept of Green's functions and diagrammatic perturbation theory. With this it becomes 
possible to define the central Luttinger-Ward functional and the self-energy functional 
which serve to set up the dynamical variational principle. These points are discussed 
in Sec. 6. With the variational cluster approximation we give a standard example for 
a non-perturbative approximation constructed from the dynamical variational principle. 
Consistency issues, symmetry breaking and the systematics of dynamical approxima- 
tions are discussed in Sec. 7. Sec. 8 particularly focuses on approximations related to 
dynamical mean-field theory. A summary and the conclusions are given in Sec. 9. 

Sees. 2-5 are written on a standard textbook level and can be understood with basic 
knowledge in many-body theory. The contents of Sees. 6 - 8 is basically taken from Ref. 
[28] but include some extensions and changes necessary for a self-contained presentation 
and to make the topic more accessible to the less experienced reader. 

2. MODELS AND VARIATIONAL METHODS 

We consider a system of electrons in thermodynamical equilibrium at temperature T 
and chemical potential /i. The Hamiltonian of the system H = H{t,U) = Ho{t) + 
H\{U) consists of a non-interacting part specified by one-particle parameters t and an 
interaction part with interaction parameters U : 

Hl{U) = ^ L UapSrcWpCyCS- (1) 

The index a refers to an arbitrary set of quantum numbers labeling an orthonormal basis 
of one-particle states As is apparent from the form of H, the total particle number 
N — Y.a^a with Ha = c^Ca is conserved, t and U refer to the set of hopping matrix 
elements and interaction parameters and are formally given by: 
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f«^^r = "'(a|<^'(/S|^i^lr)"'l«)'^' (2) 

where /2m is the electron's kinetic energy, V{r) the external potential and 
const. /|r'-^) — r'^^^l the electrostatic Coulomb interaction between two electrons "1" 
and "2". One has to be aware, however, that in many contexts t and U are merely seen 
as model parameters or considered as effective parameters which in addition account 
for effects not included explicitly in the Hamiltonian, such as metallic screening, for 
example. 

The Hamiltonian describes the most general two-particle interaction. To give exam- 
ples and to apply the techniques to be discussed below to a more concrete situation, 
it is sometimes helpful to focus on a less general model. The famous Hubbard model 
[19, 20,21], 

^ = L ^if^io'^ j^^^H niaUi-c , (3) 
ijo io 

is a prototypical model for a system of strongly correlated electrons. Here, electrons 
are assumed to hop over the sites of an infinitely extended lattice with a single spin- 
degenerate atomic orbital per lattice site: a = (io). The hopping integrals are assumed to 
be diagonal with respect to the spin index and to be spin-independent. Furthermore, the 
interaction is assumed to be strongly screened and to act only locally, i.e. two electrons 
must occupy the same lattice site i to interact via the Hubbard-i7. Due to the Pauli 
principle, these electrons must then have opposite spin projections a =t, i- 

There are numerous and largely different many-body techniques for an approximate 
solution of the Hubbard model or for the more general model Eq. (1). Here, we will 
concentrate on ground-state properties or properties of the system in thermal equilibrium 
and focus on two classes of approaches, namely techniques based on a 

• "static" variational principle 5t2[p] = 
as well as techniques based on a 

• "dynamic" variational principle 5n[Z] = 

which represent prototypical examples of different variants of variational principles. 
These two classes of principles are different, and actually there is no (known) mapping 
between them. On the other hand, there are a number of illuminating and apparent 
analogies which are worth to be discussed. Formally, the principles are exact. The static 
principle provides the exact state of the quantum system or, at finite temperature, the 
exact density matrix p of the system in thermal equilibrium. The dynamical principle, 
on the other hand, yields the exact equilibrium self-energy E or Green's function of the 
system. For all practical issues, it is clear, however, that approximations are necessary. 

There are some obvious advantages of approximations constructed from a variational 
principle of the form 5Q.{x) = 0: 

• The usual way to apply the variational principle is to propose some physically 
motivated form for the quantity of interest x which may depend on a number of 
variational parameters A = (Ai, The optimal x is then found by varying A 
to find a set of parameters Ao that satisfies dQ.{x{Xo))/dX = 0. This yields the 
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FIGURE 1. The grand potential CI is given as a function of some quantity x (schematic). At the physical 
value xq the grand potential is at a minimum n(xo) corresponding to the physical value for D.. Close to xq 
the function i2(x) is quadratic to a good approximation (black dotted line). Any approximate value xq + Ax 
sufficiently close to xq therefore provides an approximate grand potential with an error AO. oc (blue 
dashed lines). 

approximation x{Xo) to the exact xq. As there is not necessarily a small parameter 
involved, this way of constructing approximations is essentially non-perturbative. 
This also means, however, that the ansatz x{k) has to be justified very carefully. 

• The variational procedure not only yields an approximation for xq but also for the 
grand potential Q.. As is obvious from Fig. 1, if the approximate x{Xq) = + Ax 
is sufficiently close to the exact or physical value xq, i.e. if /Sx is sufficiently small, 
then the error in the grand potential is of second order only, AQ oc Ax^. 

• From the approximate grand potential one can derive, by differentiation with re- 
spect to parameters of the Hamiltonian, an in principle arbitrary set of physical 
quantities comprising thermal expectation values but also time-dependent corre- 
lation functions via higher-order derivatives. As a rule of thumb, the higher the 
derivative the more accurate must be the approximate grand potential to get reli- 
able estimates. The fact that an approximate but explicit form for a thermodynam- 
ical potential is available, ensures that all quantities are derived consistently. E.g. 
the thermodynamical Maxwell's relations are fulfilled by construction. 

• An approximation based on a variational principle can in most cases be general- 
ized systematically. One simply has to allow for more variational parameters in 
the ansatz x = x{X). There is a clear tradeoff between the accuracy of the approx- 
imation on the one hand and the necessary computational effort to evaluate the 
resulting Euler equation dQ.{x{XQ))/dX = on the other hand when increasing 
the parameter space. 

• If the grand potential is at a (global) minimum for the physical value xq, then any 
approximation yields an upper bound to the physical Q.{xq). This is an extremely 
helpful property since it allows to judge on the relative quality of an approximation 
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(i.e. as compared to another one). However, not all variational principles are min- 
imum principles since usually x is a multicomponent quantity. Then, 5Q.{xo) = 
merely means that the grand potential is stationary at xq but it is not necessarily at a 
minimum (or maximum). As will be seen below, the static principle is a minimum 
principle while the dynamical variational principle is not. 



3. STATIC VARIATIONAL PRINCIPLE 

To derive the static (generalized Ritz) variational principle, we will first compute the 
static response of an observable to a small static perturbation. This will be used to prove 
the concavity of the grand potential which is necessary to derive the desired minimum 
principle. 



3.1. Static response 

The grand potential of the system with Hamiltonian H{t,U) at temperature T and 
chemical potential jU is given by Q-t^u = —TXnZt^u where 

^t,u = irpt,u (4) 

is the partition function and 

p,,t/ = exp(-/3 C/)-AiA^)) (5) 

the equilibrium density operator and /3 = l/T. The dependence of the partition function 
(and of other quantities discussed below) on the parameters t and U is frequently made 
explicit through the subscripts. 

Let A be a (one-particle or an interaction) parameter of the Hamiltonian that couples 
linearly to the observable A. We furthermore assume that the "physical" Hamiltonian 
is obtained for A = 1, i.e. H{t,U) = H{X = 1) where H{X) = H{t,U) - A + AA = 
Hq + XA. a straightforward calculation then yields 

Note H and A do not necessarily commute and that the physical value of the expectation 
value is obtained by setting A = 1 . 

The computation of the second derivative is a bit more involved but also straightfor- 
ward. We have: 



^A2 dX 
Using dZ/dX = -/3Z(A), this yields 



/3(A)^ + -tr(A^e-^(^(^)-^^) ) . (8) 
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The derivative can be performed with the help of the Trotter decomposition: 

Writing X = e^^Me-mi^^-^^^^ for short, 

trA— = trA-^ lim X'" = lim trA Y X'^'-' ^—X'-^ . (10) 

OA OA m— ra^>oo '-^ (J A 

r=\ 



With dXjdX = -^AX we get: 

^J.J^^^-|i{Ho+XA-^iN) ^ ^-j^ y P_j^x"'-'-AX'- . (1 1) 

3 in — ^oo I'M 



In the continuum limit m — )■ oo we define T = r/3/m e [0, j8]. Hence = j5 /m, and 

trA^r^'"'^^^"^^^ = -Z Jt(A(t)A(0)) (12) 

Jo 

with the Heisenberg representation 

A(t) = e(^/-MA')t^g-(//-MA')t (13) 

for imaginary time t = —it where < T < /3. Collecting the results, we finally find: 

^ = /3 (A)2 - Jr(A(T)A(0)) = ^ . (14) 

Physically, this is the response of the grand-canonical expectation value of the observ- 
able A subjected to a small static external perturbation dX . 

Here the result can be used to show that the grand potential Q is a concave function 
of A: 

^<0. (15) 
This is seen as follows: With AA = A — (A) we have 

|^ = -//^T((A-(A))(T)(A-(A))(0)) (16) 

Using the definition of the quantum-statistical average and A —A\ this implies 
d^a rl^ . 1 



Hence the grand potential is a concave function of any parameter X that linearly enters 
the Hamiltonian. 
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3.2. Generalized Ritz principle 



To set up the famous Ritz variational principle, we define 

This represents the energy of the quantum systems as a functional of the state vector. 
The functional parametrically depends on t and U . The Ritz variational principle then 
states that the functional is at a (global) minimum for the ground state 1^0(^5^^)) of the 
system: 

£,,t/[|^o(?,t/))]=min. (19) 

The proof is straightforward and can be found in standard textbooks on quantum me- 
chanics. 

In the following we will generalize this principle to cover systems in thermal equi- 
librium with a heat bath at finite temperature T and refer to this as the generalized Ritz 
principle. The classical version of the generalized principle goes back to Gibbs [29] and 
was lateron proven for quantum systems by von Neumann and Feynman [30, 31, 32]. 

Let us first define a functional which gives the grand potential of the system in terms 
of the density operator: 

a,^u[p]=^^[p{Ht^u-^N + T\np)^ . (20) 

Again, the functional parametrically depends on t and U as made explicit by the sub- 
scripts and on T and /i (this dependence is suppressed in the notations). The generalized 
Ritz principle then states that the grand potential is at a (global) minimum, 

t2,,[/[p] =min, (21) 

for the exact (the "physical") density operator of the system, i.e. for 

exp(-p(B,„-MW)) 

and that, if evaluated at the physical density operator, yields the physical value for the 
grand potential: 

^tAPt.u] = ^t,u = -nntrexp(-/3(//^,t/ - AiiV)) . (23) 

For the proof, we first note that the latter is satisfied immediately when inserting Eq. 
(22) into Eq. (20). Hence, it remains to show that {/[p] > Q.t^u for "arbitrary" p. The 
argument of the functional, however, should represent a physically meaningful density 
operator, i.e. p shall be normalized (trp = 1), positive definite (p > 0) and Hermitian 
(P = P^)- 

To get a sufficiently general ansatz, we introduce the concept of a reference system. 
This is an auxiliary system with a Hamiltonian 

H' = H{t\ U') = Ho{t') + Hi {U') (24) 
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that has the same structure as the Hamihonian of the original model H{t,U) but with 
different one-particle and interaction parameters. The only purpose of the reference 
system is to span a space of trial density operators 



^ = {p,, j;/ 1 U' arbitrary} 



(25) 



which are given as the exact density operators of the reference system when varying the 
parameters t' and U' . Hence, a trial p is given by 



P = Pt'.u' 



exp(-/3(//,,t/'-MA^)) 



(26) 



Eq. (25) defines the domain Si of the functional Eq. (20). Note that the physical ptxj G S. 
Inserting Eq. (26) in Eq. (20) we get: 



where the expectation value is done with respect to the reference system. 
Now, consider the following the partition: 

We have H{0) = Hf, and //(I) = and with 

^{X) = -rintrexp(-/3(//(A)-AtA^)) 



(27) 



(28) 



(29) 



we get Q(0) = jjf and Q(l) = Q.t,u- The first term on the r.h.s. of Eq. (27) represents 
the expectation value of a Hermitian operator that couples linearly via A to the Hamilto- 
nian H{X), see Eq. (28). Using Eq. (6) we can therefore immediately write Eq. (27) in 
the form 

^t,u [Pt',u'] = A = 0) + . (30) 

1=0 

On the other hand, Q.{X) is a concave function of X, as has been shown in the preceding 
section. Since any concave function is smaller than its linear approximation in some 
fixed point, e.g. in A = 0, we have: 



d^{X) 



dX 



■ X > n{X) 



(31) 



1=0 



Evaluating this relation for X = I and using Eq. (30), yields 

dQ{X) 



dX 



> a{i) = at,u- 



(32) 



1=0 



This proves the validity of the generalized Ritz principle. 
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4. USING THE RITZ PRINCIPLE TO CONSTRUCT 
APPROXIMATIONS 



The standard application of the (generalized) Ritz principle is to construct the static 
mean-field approximation. This represents the well-known Hartree-Fock approximation 
but generalized to systems at finite temperatures. 



4.1. Variational construction of static mean-field theory 

The general scheme to define variational approximations which can be evaluated in 
practice is to start from the variational principle 8Q.t,u [p] = and to insert an ansatz for 
p for which the functional can be evaluated exactly. To this end, one has to restrict the 
domain of the functional: 

pG^'c^. (33) 

Usually, this is necessary since the grand potential and the expectation value on the r.h.s. 
of Eq. (27) are not available for interacting systems. A restriction of the domain of the 
functional is equivalent with a restriction of the reference system, i.e. with a restricted 
set of parameters t' and U' . Any choice for ^' results in a particular approximation. 
Static mean-field theory emerges for the reference system 

= 14/34^/3 (34) 

ap 

where the interaction term is dropped, U' = 0, but where all one-particle parameters t' 
are considered as variational parameters. This is an auxiliary system of non-interacting 
electrons. The corresponding restricted domain is: 

^' = {Pt',0 1 1' arbitrary} C ^ . (35) 

Hence, static mean-field theory aims at the optimal independent-electron density opera- 
tor to describe an interacting system. In the following we write 

Pt'^Pt',u^o = ^e-P^"'-^'^ (36) 

and (■■■)' = tr(pj/ ■ ■ ■) for short. Our goal is to determine the optimal set of variational 
parameters t' from the conditional equation 

^-at,u[Pt']=0. (37) 



To start with, we note 



^t,u[Pt']=i^{pAHt,u-^N + T\npt,)'^ . (38) 
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Inserting the trial density operator of the non-interacting reference system, we find: 

QtM = {Ht,u-^Ny-{Ht,^o-^Ny + Qt,^o (39) 

or 

The dependence on the variational parameters t'^y is twofold: There is an explicit depen- 
dence that is obvious in the third and the fourth term on the r.h.s. and there is an addi- 
tional implicit dependence via the expectation value (■■■)'. To calculate the derivative in 
Eq. (37), we first note that according to Eq. (6) dQ.t'fi/dt'^y = (cJiCv)'- Furthermore, we 
define (see Eq. (14)): 

Kv,P = ^^i^ = UcicpYiclcyY- JT(4(T)c^(T)4cv)'. (41) 

Therewith, Eq. (37) reads: 

l^^a;6^av/x/3 + 9 ^apSy-TJ- {(^a^^l^^r^sY -Y^ap^av^p = . (42) 
a/3 ^ aPyS ^'^v ap 

At this point we can make use of the fact that the reference system is given by a 
Hamiltonian that is bilinear in the creators and annihilators. In this case Wick's theorem 
(see e.g. Ref. [1, 2, 3]) applies: Any 2A''-point correlation function consisting of A'^ 
creators and N annihilators in an expectation value with respect to a bilinear Hamiltonian 
can be simplified and written as the sum over all different full contractions. Here, a 
full contraction is a distinct factorization of the 2A/^-point correlation function into a 
product of A'^ two-point correlation functions. Usually, Wick's theorem is formulated for 
time-ordered product of creators and annihilators, and time ordering produces, in the 
case of fermions, a minus sign for each transposition of these operators. For the static 
expectation value encountered here, we only have to consider a creator to be "later" than 
an annihilator, to realize that in this sense the expectation value in the second term on 
the r.h.s. is already time ordered and to take care of the minus sign when ordering the 
resulting two-point correlation functions. We find: 

{cWpCyCsY = {cic§Y{c'pCyY - {clcCyYic^CsY ■ (43) 

The result can also be derived in a more direct (but less elegant) way without using 
Wick's theorem, of course. Using this and rearranging terms, we have: 

Y Uap5y{cWpCyC§Y = ^ {UapSy - Uapys) {cic§Y {c^^CyY . (44) 
aPyS apyd 

We carry out the differentiation: 

-4r- L ^apdyiciclcyCsY 
^'/iv apyS 
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E {UapSY-UapYs) [{clcsYK'p^^^ + K'^^^gicyyYj (45) 



and again rearrange terms to get: 



and thus 



where we made use of Uapgy = UpayS - Collecting the results, we have 

T,^al5K'av^ip-H^apKv^ip+ H {U ayp8 - Uyafis) {c\c8)' K'^^^^ = Q (48) 



or 



«j8 V y5 / 

Assuming that K can be inverted, this implies 

= ^afi+Y^i^aypS-Uyaps) {clyCs)' ■ (50) 
y5 

Hence, the optimal one-particle Hamiltonian of the reference system reads: 

^'=i(^«/3+<70'«'^ ^^^^ 

where 

^LT^ = E (^ayjSS - f^yajSs) (CyC^)' (52) 
y5 

is the (frequency-independent) Hartree-Fock self-energy. Note that the self-energy has to 
be determined self-consistently: Starting with a guess for E*-^^), we can fix the reference 
system's Hamiltonian H' . The two-point correlation function of the reference system 
{caCp)' is then easily calculated by a unitary transformation of the one-particle basis set 
a*-^ k such that the correlation function becomes diagonal, {c^cj^i)' <^ 5j^j^, and by using 
Fermi gas theory to get {clck)' from the Fermi-Dirac distribution and, finally, by back- 
transformation to find (caCp)'. With this, a new update of the Hartree-Fock self-energy 
is obtained from Eq. (52). 
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The first term in Eq. (52) is the so-called Hartree potential. It can be interpreted 
classically as the electrostatic potential of the charge density distribution resulting from 
the A'^ electrons of the system. Opposed to the first term, the second one is spatially non- 
local if written in real-space representation. This is the Fock potential produced by the 
A'' electrons and has no classical analogue. Note that there is no self-interaction of an 
electron with the potential generated by itself: Within the real-space representation, the 
corresponding Hartree and Fock terms are seen to cancel each other exactly. 



4.2. Grand potential within static mean-field theory 

The final task is to compute the grand potential for the optimal (Hartree-Fock) density 
operator, i.e. 

a^^J^ = QtviPt'] = ir(p,,{Ht^u-^N + T\np,,)) (53) 
where (the self-consistent) t' is taken from Eq. (50). We find: 

aj8 ajidy afi 

Using Wick's theorem, 

(HF 



apyS a/3 
and inserting the optimal t'^^ = t^^ +L75 {UayfiS - Uyaps) ('^7^5)'. we arrive at: 

^^"^^=^''.0 + ^ L (f^a/35y-C^a/3y5) (4c-5)'(4^r)' 
a/375 

- Y i^aypS-Uyaps) {cl^Sy{cl,Cjiy . (56) 
ajiyS 

With the substitution (a/37) ~^ (/^70f) i^i the second term, this yields 

af'J^=a,^0-\ I Kpsy-Uapys) (4c5)'(4c7)' = ^^"J^^ - ^ ^E^f/) (4c/3)' • 
ajiyS aji 

(57) 

Using Wick's theorem "inversely", 

^!^f ^ = - ^ E ^a/35y(44w)'- (58) 
apyd 
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This is an interesting result as it shows that the Hartree-Fock grand potential is different 
from the grand potential of the reference system q which is the grand potential 
of a system of non-interacting electrons. Due to the "renormalization" of the one- 
particle parameters t — )■ the grand potential of the reference system q does already 
include some interaction effects. As Eq. (58) shows, however, there is a certain amount 
of "double counting" of interactions in ^t'fi which has to be corrected for by the 
second term. The second term is the Coulomb interaction energy of the electrons in 
the renormalized one-particle potential t' and lowers the Hartree-Fock grand potential. 

fHF) 

This is important as we know that Q.^ ^ must represent an upper bound to the exact 
grand potential of the system: 

E Uafi8Y'\^^ac\cyC^)' >Q.t,u ■ (59) 

Concluding, we can state that Hartree-Fock theory can very easily be derived from 
the generalized Ritz principle. The only approximation consists in the choice of the 
reference system which serves to span a set of trial density operators. The rest of the 
calculation is straightforward and provides consistent results. 

To estimate the quality of the approximation, we replace U^^^y ^U^^^y and 
expand the exact grand potential in powers of the interaction strength X: 

^t,u = ^tfl + A + ff{X^) . (60) 

Using Eq. (6), this gives 

^t,u = ^t,o + :^\ E Uafi5j{ciclcyC8)' + ff{X^), (61) 

where we have replaced the expectation value with respect to the non-interacting 
system H{t,0) by the one with respect to the Hartree-Fock reference system 
//(?', 0) where t' is the self-consistent one-particle potential. Since t^^p ~ ^'ap ~ 

"LyS {^ayjSd ~ Uyaps) i'^y'^s)' = this is correct up to terms of order ^{X^). With 
the same argument we can treat the first term on the r.h.s. yielding: 

at,U=^t',0 + Y,irap-t'ap)^^ + ^{^^)+^^ £ UapSyiciclcyCsY + ^iX^). 
a/3 "^f^P ^ aPyd 

(62) 

Using Eq. (6) once more, dClt'^/dtap = {caCp)', it follows 

^t,U = ^t'fi + Y,(^ap-t'ap){clcCpy +^ UapSy{cWp(^r^Sy+^{^^)- (63) 
aj8 apyS 

where we have set A = 1 in the end. Comparing with Eq. (54), this shows that 

^t,U = - ^ E U^pSyicUlcyCsY + ff{X^) (64) 
aPyd 
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or 

^',i/ = ^T + ^(^')- (65) 
Static mean-field theory thus predicts the correct grand potential of the interacting 
electron system up to first order in the interaction strength. It is easy to see, however, that 
already at the second order there are deviations. In fact, the diagrammatic perturbation 
theory shows that static mean-field (Hartree-Fock) theory is fully equivalent with self- 
consistent first-order perturbation theory only. This leads us to the conclusion that 
despite its conceptual beauty, static mean-field theory must be expected to give poor 
results if applied to a system of strongly correlated electrons. 



4.3. Approximation schemes 

Let us try to learn from the presented construction of static mean-field theory using 
the generalized Ritz principle and pinpoint the main concepts such that these can be 
transferred to another variational principle which might then lead to more reliable 
approximations. We consider a variational principle of the form 

dht,u[x]=0, (66) 

where x is some unspecified multicomponent physical quantity. It is assumed that the 
functional t/[jc] is stationary at the physical value for jc = Xt u and, if evaluated at 
the physical value, yields the physical grand potential Q.t u- Typically and as we have 
seen for the Ritz principle, it is generally impossible to exactly evaluate the functional 
for a given x and that one has to resort to approximations. Three different types of 
approximation strategies may be distinguished, see also Fig. 2: 

In a type-I approximation one derives the Euler equation 5£lt^u[A/ = ^ first and 
then chooses (a physically motivated) simplification of the equation afterwards to render 
the determination of Xt,u possible. This is most general but also questionable a priori, 
as normally the approximated Euler equation no longer derives from some approximate 
functional. This may result in thermodynamical inconsistencies. 

A type-II approximation modifies the form of the functional dependence, Q.f ij [••■]—!■ 

^!'^[' ■ ■]' '^^ S^*^ ^ simpler one that allows for a solution of the resulting Euler equation 

5^^^'li[x]/ dx = 0. This type is more particular and yields a thermodynamical potential 
consistent with Xt^u- Generally, however, it is not easy to find a sensible approximation 
of a functional form. 

Finally, in a type-Ill approximation one restricts the domain of the functional which 
must then be defined precisely. This type is most specific and, from a conceptual point 
of view, should be preferred as compared to type-I or type-II approximations as the 
exact functional form is retained. In addition to conceptual clarity and thermodynamical 
consistency, type-Ill approximations are truly systematic since improvements can be 
obtained by an according extension of the domain. 

Examples for the different cases can be found e.g. in Ref. [33]. The presented deriva- 
tion of the Hartree-Fock approximation shows that this type-Ill. The classification of ap- 
proximation schemes is hierarchical: Any type-Ill approximation can also be understood 
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FIGURE 2. Hierarchical relation between different types of approximations that can be employed to 
make use of a formally exact variational principle of the form 5n,,[/[jc] = 0. Type I: approximate the 
Euler equation. Type II: simplify the functional form. Type III: restrict the domain of the functional. 



as a type-II one, and any type-II approximations as type-I, but not vice versa (see Fig. 
2). This does not mean, however, that type-III approximations are superior as compared 
to type-II and type-I ones. They are conceptually more appealing but do not necessarily 
provide "better" results. 

5. DYNAMICAL QUANTITIES 

To set up the dynamical variational principle, some preparations are necessary. We first 
introduce the one-particle Green's function and the self-energy, briefly sketch diagram- 
matic perturbation theory and also discuss how within the framework of perturbation 
theory it is possible to construct non-perturbative approximations like the so-called clus- 
ter perturbation theory (CPT). 

5.1. Green's functions 

We again consider a system of interacting electrons in thermodynamical equilibrium 
at temperature T and chemical potential /i. The Hamiltonian of the system is H = 
H{t,U), see Eq. (1). Now, the one-particle Green's function [1, 2, 3] 

Gafi{co) = {{ca\cl)) (67) 

of the system will be the main object of interest. This is a frequency-dependent quantity 
which provides information on the static expectation value of the one-particle density 
matrix CaC^g but also on the spectrum of one-particle excitations related to a photoemis- 
sion experiment [34]. The Green's function can be defined for complex ft) via its spectral 
representation: 

Ga/3(«)= r dz^^^, (68) 

where the spectral density 

/oo 
dt exp{izt)Aap{t) (69) 
-oo 
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is the Fourier transform of 



Aap{t) = ^{[ca{t),cl{0)]+), (70) 

which involves the anticommutator of an annihilator and a creator with a Heisenberg 
time dependence 0{t) = exp{i{H — iiN)t)Oexp{—i{H — IJ-N)t). Due to the thermal 
average, (O) = tr{pt^uO)/Zt_ij, the Green's function parametrically depends on t and 
U and is denoted by Gt y ap (f^)- 

For the diagram technique employed below, we need the Green's function on the 
imaginary Matsubara frequencies (O = i(0„ = i{2n + l)7i:T with integer n [1, 2, 3]. In 
the following the elements Gf u ^^{icOn) are considered to form a matrix Gt.u which is 
diagonal with respect to n. 

The "free" Green's function G,^o is obtained for U = Q, and its elements are given by: 

This is a result that can easily be derived from the equation-of- motion technique [2]. 
Therewith, we can define the self-energy via Dyson's equation 

Gtu = \ , (72) 

i.e. llt.u = Gj'o — ^7u- ^^^^ meaning of this definition becomes clear within the 
context of diagrammatic perturbation theory [1, 2, 3]. 



5.2. Diagrammatic perturbation theory 

The main reason why the Green's function is put into the focus of the theory is that a 
systematic expansion in powers of the interaction can be set up. Here, a brief sketch of 
perturbation theory can be given only (see Refs. [1, 2, 3] for details). Starting point is 
the so-called S-matrix defined for < T, t' < /3 as 

S{T,T') = e''^^'e-^^'-'\-^^'\ (73) 

where = Ht^u - and JTq = Ht^ - fiN. 

There are two main purposes of the S-matrix. First, it serves to rewrite the partition 
function in the following way: 

Z,,[; = tre-^'^ = tr(e-^^'e^^'e-^'^) = tr (e-^'^"5(/3,0)) =Z,,o(S(/3,0))(°) . (74) 

The partition function of the interacting system is thereby given in terms of the partition 
function of the free system, which is known, and a free expectation value of the S-matrix. 
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The second main purpose is related to the time dependence. The Green's function on 
the Matsubara frequencies G^^g (?&)„) is related to the imaginary-time Green's function 
Gap (t) via discrete Fourier transformation: 

1 °° 

Gap(^) = -^ E Gap{i(On)e-'^'' (75) 

and 

GaniiOJn) = dTGap{T)e"'"\ (76) 

Let ^ be the (imaginary) time-ordering operator. Then 

Gap{T) = -{^Cair)clm (77) 

is given in terms of an annihilator or creator which has the full (imaginary) Heisenberg 
time dependence: 

Ca{T) = e^'cae-'^\ (78) 

With the help of the S-matrix the interacting time dependence can be transformed into a 
free time dependence, namely: 

c«(T) =S(0,T)c/,a(T)5(T,0) , 4(T) =S(0,t)c;,,(t)5(t,0) . (79) 

Here, the index / ("interaction picture") indicates that the time dependence is due to 
only. This time dependence is simple and known: 

Therefore, the remaining problem consists in the determination of the S-matrix itself. 
The defining equation Eq. (73) cannot be used since this involves the exponential of the 
fully interacting Hamiltonian. There is, however, a much more suitable representation 
of the S-matrix. To derive this, we first set up its equation of motion. A straightforward 
calculation shows: 

-|-5(t,tO=^i./(t)S(t,tO- (81) 

Here, H\ = H — Hq h the interaction part of the Hamiltonian and in H\ i{x) the time 
dependence is due to Hq only. A formal solution of this differential equation with the 
initial condition 5(t, t) = 1 can be derived easily using the time-ordering operator ^ 
again: 

5(t, x') = ^exp Jt'V/(t")^ • (82) 

Note, that if all quantities were commuting, the solution of Eq. (81) would trivially be 
given by Eq. (82) without The appearance of 3^ can therefore be understood as 
necessary to enforce commutativity. 
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Using this S-matrix representation, the partition function and the Green's function can 
be written as: 



The important point is that the expectation values and time dependencies appearing here 
are free and thus known. 

Expanding the respective exponentials in Eq. (83) and Eq. (84) provides an expansion 
of the static partition function and of the dynamic Green's function in powers of the 
interaction strength. The coefficients of this expansion are given as free expectation 
values of time-ordered products of annihilators and creators with free time dependence. 
Hence, according to Wick's theorem the coefficient at the k-th order is given by a sum of 
{2k) ! terms (in case of the partition function, for example), each of which factorizes into 

an A;- fold product of terms of the form (5^Ca(T)c|j^,(T'))*^^) called contractions. Apart 
from a sign, this contraction is nothing but the free Green's function. The summation 
of the {2k) ! terms is organized by means of a diagrammatic technique where vertices 
(U) are linked via propagators (G/ o)- The details of this technique can be found in Refs. 
[1, 2, 3], for example. 

There are three important points, however, which are worthwhile to be mentioned 
here: The first is the linked-cluster theorem which allows us to concentrate on connected 
diagrams only. In the case of the Green's function, the disconnected diagrams exactly 
cancel the denominator in Eq. (84). As concerns closed diagrams contributing to the 
partition function, Eq. (83), the sum of only the connected closed diagrams yields, apart 
from a constant, InZ, {/, i.e. the grand potential. 

Second, we can identify so-called self-energy insertions in the diagrammatic series, 
i.e. parts of diagrams with exhibit links to two external propagators. Examples are given 
in Fig. 3A where we also distinguish between reducible and irreducible self-energy 
insertions. The reducible ones can be split into two disconnected parts by removal 
of a single propagator line. The self-energy is then defined diagrammatically as the 
sum over all irreducible self-energy insertions, and Dyson's equation can be derived 
diagrammatically, see Fig. 3B. 

Third, there are obviously irreducible self-energy diagrams which contain self-energy 
insertions, see the first diagram in Fig. 3C, for example. Self-energy diagrams with- 
out any self-energy insertion are called skeleton diagrams. Skeleton diagrams can be 
"dressed" by replacing in the diagram the propagators which stand for the free Green's 
function with "full" propagators (double lines) which stand for the fully interacting 
Green's function, see Fig. 3D. It is easy to see that the self-energy is given by the sum 
of the skeleton diagrams only provided that these are dressed, see Fig. 3E. Therewith, 
the self-energy is given in terms of the interacting Green's function. The corresponding 
functional relationship is called skeleton-diagram expansion. 




(83) 



and 



Ga^{l) 




(84) 
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FIGURE 3. (A) A reducible and two irreducible self-energy insertions. Vertices (U) are represented 
by dashed lines, propagators (Gt.o) by solid lines. (B) Diagrammatic representation of Dyson's equation. 
The fully interacting Green's function is symbolized by a double line, the self-energy by a semi-circle. 
(C) Self-energy diagrams with (left) and without (right) self-energy insertions. (D) A skeleton self-energy 
diagram (left) and a dressed skeleton (right). (E) The skeleton-diagram expansion of the self-energy. 



Some more important properties of the self-energy can be listed: (i) The self-energy 
has a spectral representation similar to Eq. (68). (ii) In particular, the corresponding 
spectral function (matrix) is positive definite, and the poles of T.t,u on the real axis 
[35]. (iii) Ec^^(£o) = if a or /3 refer to one-particle orbitals that are non-interacting, 
i.e. if a or /3 do not occur as an entry of the matrix of interaction parameters U. Those 
orbitals or sites are called non-interacting. This property of the self-energy is clear from 
its diagrammatic representation, (iv) If a refers to the sites of a Hubbard-type model 
with local interaction, the self-energy can generally be assumed to be more local than the 
Green's function. This is corroborated e.g. by explicit calculations using weak-coupling 
perturbation theory [36, 37, 38] and by the fact that the self-energy is purely local on 
infinite-dimensional lattices [22, 39]. 



5.3. Cluster perturbation theory 

The method of Green's functions and diagrammatic perturbation theory represents 
a powerful approach to study systems of interacting electrons in thermal equilibrium. 
At first sight it seems that only weakly interacting systems are accessible with this tech- 
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FIGURE 4. To derive the CPT, consider perturbation theory in U and the inter-cluster hopping V. (A) 
The exact Green's function would be obtained by starting from the free (U = V = 0) propagator G'q and, 
in a first step, sum the diagrams of all orders in V but for U = 0. Since the diagrammatic series is a 
geometrical series only, this can be done exactly. Formally, this corresponds to usual scattering theory 
for free electrons and yields the U ^0 Green's function Gq. In a second step, the diagrams generated by 
electron-electron interaction U would have to be summed up to get the full G. This, however, cannot be 
done in practice. (B) The CPT approximation reverses the two above steps. First, the free (U — V — 0) 
propagator Gq is renormalized by electron-electron interaction U to all orders but at V = 0. This yields the 
fully interacting cluster Green's function G' . While, of course, G' cannot be computed by the extremely 
complicated summation of individual U diagrams, it is easily accessible numerically if the cluster size is 
sufficiently small. In the second step, the V = propagator G' is renormalized due to potential scattering. 
This yields an approximation Gcpt to the exact Green's function G. (C) The CPT treatment is approximate 
since certain diagrams are neglected. The last panel shows a self-energy diagram, second order in U, 
second order in V, which is not taken into account within CPT. 



nique. In fact, summing only the diagrams up to a given order in the expansion represents 
a strict weak-coupling approach. For prototypical lattice models with local interaction, 
such as the Hubbard model, however, an expansion in powers of the hopping appears to 
be more attractive since most of the interesting phenomena, like collective magnetism 
or correlation-driven metal-insulator transitions emerge in the strong-coupling regime. 
Let us therefore focus on the Hubbard model in the atomic limit for a moment: 

^at = Y,^iic](yCia + Y ^«,cT«/-(7 • (85) 
ia ia 

Here, the problem separates into atomic problems which can be solved easily since the 
atomic Hilbert space is small. It is therefore tempting to start from the atomic limit 
and to treat the rest of the problem, the "embedding" of the atom into the lattice, in 
some approximate way. The main idea of the so-called Hubbard-I approximation [19] 
is to calculate the one-electron Green's function from the Dyson equation where the 
self-energy is approximated by the self-energy of the atomic system. This is one of the 
most simple embedding procedures. It already shows that the language of diagrammatic 
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perturbation theory, Green's functions and diagrammatic objects, such as the self-energy, 
can be very helpful to construct an embedding scheme. 

The Hubbard-I approach turns out to be a too crude approximation to describe the 
above-mentioned collective phenomena. One of its advantages, however, is that it offers 
a perspective for systematic improvement: Nothing prevents us to start with a more com- 
plicated "atom" and employ the same trick: We consider a partition of the underlying 
lattice with L sites (where L — 0°) into L/Lc disconnected clusters consisting of Lc sites 
each. If Lc is not too large, the self-energy of a single Hubbard cluster is accessible by 
standard numerical means [40] and can be used as an approximation in the Dyson equa- 
tion to get the Green's function of the full model. This leads to the cluster perturbation 
theory (CPT) [41,42]. 

CPT can also be motivated by treating the Hubbard interaction U and the inter- 
cluster hopping V as a perturbation of the system of disconnected clusters with intra- 
cluster hopping t' . This has been shown recently [43]. The CPT Green's function is then 
obtained by summing the diagrams in perturbation theory to all orders in U and V but 
neglecting vertex corrections which intermix U and V interactions, see Fig. 4. 

While these two ways of deriving CPT are equivalent, one aspect of the former is in- 
teresting: Taking the self-energy from some reference model (the cluster) is reminiscent 
of dynamical mean-field theory (DMFT) [22, 25, 26] where the self-energy of an impu- 
rity model approximates the self-energy of the lattice model. This provokes the question 
whether both, the CPT and the DMFT, can be understood in single unifying theoretical 
framework. 

This question is another motivation for the construction of approximations from a 
dynamical variational principle. Yet another one is that there are certain deficiencies of 
the CPT. While CPT can be seen as a cluster mean-field approach since correlations be- 
yond the cluster extensions are neglected, it not self-consistent, i.e. there is no feedback 
of the resulting Green's function on the cluster to be embedded (some ad hoc element 
of self-consistency is included in the original Hubbard-I approximation). In particular, 
there is no concept of a Weiss mean field as it is the case within static mean-field the- 
ory, and, therefore, CPT can neither describe different phases of an extended system nor 
phase transitions. Another related point is that CPT provides the Green's function only 
but no thermodynamical potential. Different ways to derive e.g. the free energy from the 
Green's function [1, 2, 3] give inconsistent results. 

6. SELF-ENERGY-FUNCTIONAL THEORY 

To overcome these deficiencies, a self-consistent cluster-embedding scheme has to be 
set up. Ideally, this results from a variational principle for a general thermodynamical 
potential which is formulated in terms of dynamical quantities as e.g. the self-energy 
or the Green's function. The variational formulation should ensure the internal consis- 
tency of corresponding approximations and should make contact with the DMFT. These 
points represent the goals self-energy-functional theory [44, 45, 46, 33] which will be 
developed in the following. 
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6.1. Luttinger-Ward generating functional 



First of all, we would like to distinguish between dynamic quantities, like the self- 
energy, which is frequency-dependent and related to the (one-particle) excitation spec- 
trum, on the one hand, and static quantities, like the grand potential and its derivatives 
with respect to jl, T, etc. which are related to the thermodynamics, on the other. A link 
between static and dynamic quantities is needed to set up a variational principle which 
gives the (dynamic) self-energy by requiring a (static) thermodynamical potential be sta- 
tionary. There are several such relations [1, 2, 3]. The Luttinger-Ward functional 4>[/[G] 
provides a special relation with several advantageous properties [9]: 

(i) ^t/[G] is a functional. Functionals A = A[- ■ ■] are indicated by a hat and should 
be distinguished clearly from physical quantities A. 



(ii) The domain of the Luttinger-Ward functional is given by "the space of Green's 
functions". This has to be made more precise later. 



(iii) If evaluated at the exact (physical) Green's function, G/,{/, of the system with 
Hamiltonian H — H{t,U), the Luttinger-Ward functional gives a quantity 

^u[Gtjj]=^t,u (86) 
which is related to the grand potential of the system via: 

^t,u = +TrlnG,,t/ -Tr(E,,i/G,,i/) . (87) 

Here the notation 

TrA = r££e''*^o^A««(/(0.) (88) 

n a 

is used where 0^ is a positive infinitesimal. 



(iv) The functional derivative of the Luttinger-Ward functional with respect to its 
argument is: 

J,M.£„[CJ. (89, 

Clearly, the result of this operation is a functional of the Green's function again. 
This functional is denoted by E since its evaluation at the physical (exact) Green's 
function Gt^u yields the physical self-energy: 

E[G,,f/]=E,,f/. (90) 

(v) The Luttinger-Ward functional is "universal": The functional relation ■ ■] 
is completely determined by the interaction parameters U (and does not depend on 
t). This is made explicit by the subscript. Two systems (at the same chemical po- 
tential /i and temperature T) with the same interaction U but different one-particle 
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parameters t (on-site energies and hopping integrals) are described by the same 
Luttinger-Ward functional. Using Eq. (89), this implies that the functional Et/[G] 
is universal, too. 

(vi) Finally, the Luttinger-Ward functional vanishes in the non-interacting limit: 

$t/[G]=0 for l/ = 0. (91) 



6.2. Diagrammatic derivation 

In the original paper by Luttinger and Ward [9] it is shown that [G] can be con- 
structed order by order in diagrammatic perturbation theory. The functional is obtained 
as the limit of the infinite series of closed renormalized skeleton diagrams, i.e. closed 
diagrams without self-energy insertions and with all free propagators replaced by fully 
interacting ones (see Fig. 5). There is no known case where this skeleton-diagram ex- 
pansion could be summed up to get a closed form for [G] . Therefore, the explicit 
functional dependence is unknown even for the most simple types of interactions like 
the Hubbard interaction. 

Using the classical diagrammatic definition of the Luttinger-Ward functional, the 
properties (i) - (vi) listed in the previous section are easily verified: By construction, 
4>t/[G] is a functional of G which is universal (properties (i), (ii), (v)). Any diagram is 
the series depends on U and on G only. Particularly, it is independent of t. Since there 
is no zeroth-order diagram, ^'t/[G] trivially vanishes for U = 0, this proves (vi). 

Diagrammatically, the functional derivative of 4>[/ [G] with respect to G corresponds 
to the removal of a propagator from each of the 4> diagrams. Taking care of topological 
factors [9, 1], one ends up with the skeleton-diagram expansion of the self-energy (iv), 
see also Fig. 3E. Therefore, Eq. (90) is obtained in the limit of this expansion. 

Eq. (87) can be derived by a coupling-constant integration as has been shown in Ref. 
[9]. Alternatively, it can be verified by integrating over ji. To this end, we first consider 
the jU derivative of the different terms on the r.h.s. of Eq. (87). For the first term we have: 



(92) 



and thus 



^t,u =2^T}_^L,^u.pa{i(0n) ^ = Tr\^t,u-^ j ■ (93) 



For the second one, we find 
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FIGURE 5. Original definition of the Luttinger-Ward functional <t>(/[G], see Ref. [9]. Double lines: 
fully interacting propagator G. Dashed lines: interaction U. 



and for the third: 



G,,ci +Tr E,.t/ 



dGt^u 



(95) 



Next, we note that 

-1 dGt.u 



Tr G 



't.u 



Tr 



1 dGt^u ^ 
dn 



dll 



Tr 



dll 



Gt.u 



-TrGr,j/ 



Gtu 

(96) 



where Dyson's equation has been used in the last step and dG^ q/^II = 1. Collecting 
the results, we have: 



dll 



(4>f,[/+TrlnG,,[/-Tri:,,f/G,,f/) = -TrG^i/ 



From the definition of the Green's function we immediately have 

TxGt^u = {N)t,u ■ 



(97) 



(98) 



{N)t,u is the grand-canonical average of the total particle-number operator. Apart from 
a sign, this can be written as the /i derivative of the grand potential. Hence, 



^($,,[/+TrlnG,,t/-TrE,,f/G,,i/) = 



(99) 



Integration over /i then yields Eq. (87). Note that the equation holds trivially for /i — )■ 
i.e. for {N)t,u since T-t^u = and ^t,u = in this limit. 



6.3. Derivation using the path integral 

For the diagrammatic derivation it has to be assumed that the skeleton-diagram series 
is convergent. It is therefore interesting to see how the Luttinger-Ward functional can 
be defined and how its properties can be verified within a path-integral formulation. 
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This is essentially non-perturbative. The path-integral construction of the Luttinger- 
Ward functional was first given in Ref. [47]. 

Using Grassmann variables [3] ^a{iO)n) = T^^^ Jq^^ dt e'^"'^^a{^) and (^«(zCi),j) = 
T^/^ j^^^ dx e^^^"'^b,^{x), the elements of the Green's function are given by 
Gt,u,apiio)n) = -{U{ia)n)^p{ia)n))t,u- The average 

Gt,uM'^") = D^D^*U{i(0„)^{ico„)exp {A,^u,^^*) (100) 

is defined with the help of the action A, j;^^* = A,^^* +^t/,<^<^* where 

^t,U,^^* = L &G)n){{iCOn+ ^)5ap -tap)^p{i(On) +Ai;^^^. (101) 

and 

^UM* = L ^apsr f ^dx^*M^myms{^) ■ (102) 

This is the standard path-integral representation of the Green's function [3]. 

The action can be considered as the physical action which is obtained when evaluating 
\h& functional 

^u,^^AGq^] = E ^*(/w„)Go;^^(/(O„)^/3(/(o,0 +Ay^^^. (103) 

/!,0!/3 

at the (matrix inverse of the) physical free Green's function, i.e. 

\u,^^*=Av,^^Ag;},]. (104) 

Using this, we define the functional 

^u[GQ'] = -T\nZu[GQ'] (105) 

with 

Zu[G^'] = J D^D^*exp (a^^^^^^Gq i]) . (106) 

The functional dependence of Q.u[Gq^] is determined by U only, i.e. the functional is 
universal. Obviously, the physical grand potential is obtained when inserting the physical 
inverse free Green's function G^q : 

iiu[G;i] = at,u. (107) 

The functional derivative of Eq. (105) leads to another universal functional: 
1 5au[Go'] 1 SZu[Go'] 



-Gi/[Go1, (108) 



T 5G,' Zu[G,'] 5Go^ 
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with the property 

Gu[G;^^] = Gt,u- (109) 

This is easily seen from Eq. (100). 

The strategy to be pursued is the following: G{/[Gq ^] is a universal {t independent) 

functional and can be used to construct a universal relation E = TLu [G] between the self- 
energy and the one-particle Green's function - independent from the Dyson equation 
(72). Using this and the universal functional Q.u[Gq^], a universal functional ^'f/[G] 

can be constructed, the derivative of which essentially yields JLu [G] and that also obeys 
all other properties of the diagrammatically constructed Luttinger-Ward functional. This 
also means that Z^/ [G] corresponds to the skeleton-diagram expansion of the self-energy. 
To start with, consider the equation 

Gu[G^ +^ = G. (110) 

This is a relation between the variables Z and G which for a given G, may be solved for 
E. This defines a functional Et/[G], i.e. 

Gu[G-^-tu[G\] = G. (Ill) 

For a given Green's function G, the self-energy Z = Zj/ [G] is defined to be the solution 
of Eq. (110). From the Dyson equation (72) and Eq. (109) it is obvious that the relation 
(110) is satisfied for the physical Z = Z^ [/ and the physical G = Gt^u of the system with 
Hamiltonian Htjj: 

%u[G,,u]='^t,u- (112) 

This construction simplifies the original presentation in Ref. [47]. The discussion on 
the existence and the uniqueness of possible solutions of the relation (110) given there 
applies accordingly to the present case. 

With the help of the functional Q.u[Gq^] and Zt/[G], the Luttinger-Ward functional 
is obtained as: 

$t/[G] =^if/[G-i+Zt/[G]]-TrlnG + Tr(Zi/[G]G). (113) 
Let us check property (iv). Using Eq. (108) one finds for the derivative of the first term: 

and, using Eq. (Ill), 

With 



5G 1 

^ = -G^jC,l (116) 
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we find: 



^%^ = «^i-I°^r%^4J-(-™nG + Tr(£„[GlG)) (117) 



and thus: 

1 d^u[G\ 



T 5Gap{i(0n) 



^uM^(^n)[G], (118) 



where, as a reminder, the frequency dependence has been reintroduced. 

The other properties are also verified easily, (i) and (ii) are obvious, (iii) follows from 
Eq. (107), Eq. (109) and Eq. (112) and the Dyson equation (72). The universality of 
the Luttinger-Ward functional (v) is ensured by the presented construction. It involves 
universal functional only. Finally, (vi) follows from Gi;=q[G^^] = G which implies (via 
Eq. (Ill)) T.u=o[G] = 0, and with Qu=o[G^^] = TrlnG we get $u=o[G] = 0- 



6.4. Dynamical variational principle 

The functional E^/ [G] can be assumed to be invertible locally provided that the system 
is not at a critical point for a phase transition (see also Ref. [44]). This allows to construct 
the Legendre transform of the Luttinger-Ward functional: 

Ft;[E] =$t;[Gf;[E]]-Tr(EGt;[E]). (119) 

Here, Gu[T.u[G]] = G. With Eq. (118) we immediately find 

1 dFu[L] 
T 5L 

We now define the self-energy functional: 



-Gf/[E]. (120) 



Its functional derivative is easily calculated: 

1 dht^um 1 



a,,f;[E] =Trln— l^+Ft;[E]. (121) 



The equation 

G,/fEl = 

G 



Gu[n = ^:^^ (123) 



is a (highly non-linear) conditional equation for the self-energy of the system H = 
Ho{t) +H\{U). Eqs. (72) and (112) show that it is satisfied by the physical self-energy 
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E = T^t^u- Note that the l.h.s of (123) is independent of t but depends on U (universahty 

of G£/[Z]), while the r.h.s is independent of U but depends on t via G^o- The obvious 
problem of finding a solution of Eq. (123) is that there is no closed form for the 
functional G{/[Z]. Solving Eq. (123) is equivalent to a search for the stationary point 
of the grand potential as a functional of the self-energy: 

—^ = 0. (124) 

This represents the dynamical variational principle and the starting point for self-energy- 
functional theory. It should be mentioned that one can equivalently formulate the prin- 
ciple with the Green's function as the basic variable, i.e. 5Q.t u[G]/5G = (see Refs. 
[48, 49], for examples). One reason to consider self-energy functionals instead of func- 
tionals of the Green's function, however, is to derive the dynamical mean-field theory as 
a type-III approximation. 



6.5. Reference system 

The central idea of self-energy-functional theory is to make use of the universality of 
(the Legendre transform of) the Luttinger-Ward functional to construct type-III approxi- 
mations. Consider the self-energy functional Eq. (121). Its first part consists of a simple 
explicit functional of E while its second part, i.e. i^t/p], is unknown but depends on U 
only. 

Due to this universality of Fu [E] , one has 

^i,,,t/[E] =Trln— +Fc,[E] (125) 

for the self-energy functional of a so-called "reference system". As compared to the 
original system of interest, the reference system is given by a Hamiltonian H' = 
Ho{t') +Hi (U) with the same interaction part U but modified one-particle parameters t'. 
This is different as compared to the static variational principle where reference systems 
with U' U may be considered. Note that the reference system has different micro- 
scopic parameters but is assumed to be in the same macroscopic state, i.e. at the same 
temperature T and the same chemical potential /i. By a proper choice of its one-particle 
part, the problem posed by the reference system H' can be much simpler than the orig- 
inal problem posed by H. We assume that the self-energy of the reference system E^/ y 
can be computed exactly, e.g. by some numerical technique. 

Combining Eqs. (121) and (125), one can eliminate the unknown functional i^t/[E]: 

4,f;[E] =a,,,^;[E]+Trln— 1— -Trln— j^-— . (126) 

It appears that this amounts to a shift of the problem only as the self-energy functional 
of the reference system again contains the full complexity of the problem. In fact, except 
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6^[i:(t')]=o 




FIGURE 6. (taken from Ref. [28]). Schematic illustration for the construction of consistent approx- 
imations within the SFT. The grand potential is considered as a functional of the self-energy which is 
parametrized by the one-particle parameters t of the Hamiltonian (U is fixed). At the physical self-energy, 
Q. is stationary (filled red circles). The functional dependence of H on E is not accessible on the entire 
space of self-energies (E space) but on a restricted subspace of "trial" self-energies parametrized by a 
subset of one-particle parameters (solid red lines) corresponding to a "reference system", i.e. a manifold 
of systems with the same interaction part but a one-particle part given by t' which renders the solution 
possible. The self-energy functional can be evaluated exactly on the submanifold of reference systems. 
A stationary point on this submanifold represents the approximate self-energy and the corresponding 
approximate grand potential (open circle). 

for the trivial case U = 0, the functional dependence of flj' is unknown - even if 
the reference system is assumed to be solvable, i.e. if the self-energy Ef'.j/, the Green's 
function Gt\u and the grand potential Q.ti,u of the reference system are available. 
However, inserting the self-energy of the reference system ^t'.u ii^to the self-energy 

functional of the original one, and using Q.t',u['^t',u] = ^t',u ^iid the Dyson equation of 
the reference system, we find: 

n,^u%',u]=^t',u+T^r\n \ TrlnG,..t/. (127) 

This is a remarkable result. It shows that an exact evaluation of the self-energy functional 
of a non-trivial original system is possible, at least for certain self-energies. This requires 
to solve a reference system with the same interaction part. 

Eq. (127) provides an explicit expression of the self-energy functional Q.t ij [Z] . This is 
suitable to discuss the domain of the functional precisely. Take U to be fixed. We define 
the space of ^-representable self-energies as 

^i/ = {Z|3r: Z = E,,i;}. (128) 
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This definition of the domain is very convenient since it ensures the correct analytical 
and causal properties of the variable Z. 

We can now formulate the above result in the following way. Consider a set of 
reference systems with U fixed but different one-particle parameters t' , i.e. a space of 
one-particle parameters .!^' . Assume that the reference system with H' = Hf/ y can be 

solved exactly for any G Then, the self-energy functional Q-t u [Z] can be evaluated 
exactly on the subspace 

S^{j = {'L\3t' e^' : 'L = 'Lf^u}cS^u- (129) 
This fact can be used to construct type-III approximations, see Fig. 6. 



6.6. Construction of cluster approximations 

A certain approximation is defined by a choice of the reference system or actually 
by a manifold of reference systems specified by a manifold of one-particle parameters 
As an example consider Fig. 7. The original system is given by the one-dimensional 
Hubbard model with nearest-neighbor hopping t and Hubbard interaction U. A possible 
reference system is given by switching off the hopping between clusters consisting of 
Lc = 2 sites each. The hopping within the cluster t' is arbitrary, this defines the space 
The self-energies in S^', the corresponding Green's functions and grand potentials 
of the reference system can obviously be calculated easily since the degrees of freedom 
are decoupled spatially. Inserting these quantities in Eq. (127) yields the SFT grand 
potential as a function of t': 

This is no longer a functional but an ordinary function of the variational parameters 
t' E The final task then consists in finding a stationary point t'^^^ of this function: 

_^ = foTt' = t',^,. (131) 

In the example considered this is a function of a single variable t' (we assume t' to be 
the same for all clusters). Note that not only the reference system (in the example the 




FIGURE 7. (taken from Ref. [28]). Variational cluster approximation (VCA) for the Hubbard model. 
Top: representation of the original one-dimensional Hubbard model H with nearest-neighbor hopping t 
and Hubbard-interaction U. Bottom: reference system H' consisting of decoupled clusters of Lc = 2 sites 
each with intra-cluster hopping f ' as a variational parameter. 
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FIGURE 8. (taken from Ref. [50]). SFT ground-state energy per site, i.e. {Q.{t') + n{N))/L, as a 
function of the intra-cluster nearest-neighbor hopping f' for Lc = 10 and different U {jJ. = U /2) at zero 
temperature. Arrows indicate the respective optimal f'. The energy scale is fixed by f = 1. 



isolated cluster) defines the final result but also the lattice structure and the one-particle 
parameters of the original system. These enter ^{t') via the free Green's function G/,o of 
the original system. In the first term on the r.h.s of Eq. (127) we just recognize the CPT 
Green's function 1 / {G^q — ^t',u)- The approximation generated by a reference system 
of disconnected clusters is called variational cluster approximation (VGA). 

An example for the results of a numerical calculation is given in Fig. 8, see also 
Ref. [50]. The calculation has been performed for the one-dimensional particle-hole 
symmetric Hubbard model at half-filling and zero temperature. The figure shows the 
numerical results for the optimal nearest-neighbor intra-cluster hopping t' as obtained 
from the VGA for a reference system with disconnected clusters consisting of Lc = 10 
sites each. The hopping t' is assumed to be the same for all pairs of nearest neighbors. In 
principle, one could vary all one-particle parameters that do not lead to a coupling of the 
clusters to get the optimal result. In most cases, however, is it necessary to restrict oneself 
to a small number of physically motivated variational parameters to avoid complications 
arising from a search for a stationary point in a high-dimensional parameter space. For 
the example discussed here, the parameter space is one-dimensional only. This is 
the most simple choice but more elaborate approximations can be generated easily. The 
flexibility to construct approximations of different quality and complexity must be seen 
as one of the advantages of the variational cluster approximation and of the SFT in 
general. 

As can be seen from the figure, a non-trivial result, namely t'^^i 7^ t = 1, is found as 
the optimal value of t'^^^^. We also notice that ?opt > f- The physical interpretation is that 
switching off the mfer-cluster hopping, which generates the approximate self-energy. 



Static and dynamic variational principles for strongly correlated electron systems February 23, 2012 33 




can partially be compensated for by enhancing the m?ra-cluster hopping. The effect is 
the more pronounced the smaller is the cluster size L^. Furthermore, it is reasonable that 
in case of a stronger interaction and thus more localized electrons, switching off the 
inter-cluster hopping is less significant. This can be seen in Fig. 8: The largest optimal 
hopping fppj is obtained for the smallest U . 

On the other hand, even a "strong" approximation for the self-energy (measured as a 
strong deviation of ^^pt from t) becomes irrelevant in the weak-coupling limit because 
the self-energy must vanish for U = 0. Generally, we note that the VGA becomes exact 
in the limit f/ = 0: In Eq. (126) the first and the third terms on r.h.s cancel each other 
and we are left with 

4,t/=oP]=Trln— 1— . (132) 

Since the trial self-energy has to be taken from a reference system with the same 
interaction part, i.e. J7 = 0, we have Z = 0, and the limit becomes trivial. For weak 
but finite U , the SFT grand potential becomes flatter and flatter until for f/ = the t' 
dependence is completely irrelevant. 

The VGA is also exact in the atomic limit or, more general but again trivial, in the 
case that there is no restriction on the trial self-energies: S^' = .Y. In this case, ^opt = t 
solves the problem, i.e. the second and the third term on the r.h.s of Eq. (127) cancel 

each other and Q.t,u[^t'.u] = ^t'.u for t' = t. 

Gluster-perturbation theory (GPT) can be understood as being identical with the VGA 
provided that the SFT expression for the grand potential is used and that no parameter 
optimization at all is performed. As can be seen from Fig. 8, there is a gain in binding 
energy due to the optimization of t', i.e. Cl{t'i) < Cl{t). This means that the VGA 
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sites 



FIGURE 10. (taken from Ref. [50]). Optimized hopping parameters for the reference systems shown in 
Fig. 7 but for larger clusters with Lc sites each as indicated. VCA results for [/ = 4, f = 1 at half-filling 
and temperature T = 0. Black: hopping assumed to be uniform. Red: two hopping parameters varied 
independently, the hopping at the two cluster edges and the "bulk" hopping. Blue: hopping at the edges, 
next to the edges and bulk hopping varied. Green: four hopping parameters varied. Orange: five hopping 
parameters varied. 



improves on the CPT result. 

Fig. 9 shows the ground-state energy (per site), i.e. the SFT grand potential at zero 
temperature constantly shifted by iiN at the stationary point, as a function of the inverse 
cluster size l/Lc. The dependence turns out to be quite regular and allows to recover 
the exact Bethe-Ansatz result (BA) [51] by extrapolation to l/Lc = 0. The VCA result 
represents a considerable improvement as compared to the "direct" cluster approach 
where Eq is simply approximated by the ground-state energy of an isolated Hubbard 
chain (with open boundary conditions). Convergence to the exact result is clearly faster 
within the VCA. Note that the direct cluster approach, opposed to the VCA, is not exact 
for U = 0. 

In the example discussed so far a single variational parameter was taken into account 
only. More parameters can be useful for different reasons. For example, the optimal 
self-energy provided by the VCA as a real-space cluster technique artificially breaks the 
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translational symmetry of the original lattice problem. Finite-size effects are expected 
to be the most pronounced at the cluster boundary. This suggests to consider all intra- 
cluster hopping parameters as independent variational parameters or at least the hopping 
at the edges of the chain. 

The result is shown in Fig. 10. We find that the optimal hopping varies between 
different nearest neighbors within a range of less than 10%. At the chain edges the 
optimal hopping is enhanced to compensate the loss of itinerancy due to the switched-off 
inter-cluster hopping within the VGA. With increasing distance to the edges, the hopping 
quickly decreases. Quite generally, the third hopping parameter is already close to the 
physical hopping t. Looking at the Lc = 10 results where all (five) different hopping 
parameters have been varied independently (orange circles), one can see the hopping to 
slightly oscillate around the bulk value reminiscent of surface Friedel oscillations. 

The optimal SFT grand potential is found to be lower for the inhomogeneous cases as 
compared to the homogeneous (black) one. Generally, the more variational parameters 
are taken into account the higher is the decrease of the SFT grand potential at optimal pa- 
rameters. However, the binding-energy gain due to inhomogeneous hopping parameters 
is much smaller compared to the gain obtained with a larger cluster. 

Gonsidering an additional hopping parameter ?pbc linking the two chain edges as a 
variational parameter, i.e. clusters with periodic boundary conditions always gives a 
minimal SFT grand potential at ?pbc = (instead of a stationary point at ?pbc = 1). This 
implies that open boundary conditions are preferred (see also Ref. [46]). 



7. CONSISTENCY, SYMMETRY AND SYSTEMATICS 

7.1. Analytical structure of the Green's function 

Gonstructing approximations within the framework of a dynamical variational princi- 
ple means that, besides an approximate thermodynamical potential, approximate expres- 
sions for the self-energy and the one-particle Greens function are obtained. This raises 
the question whether their correct analytical structure is respected in an approximation. 
For approximations obtained from self-energy-functional theory this is easily shown to 
be the case in fact. 

The physical self-energy E(^^((u) and the physical Green's function Gq^^{(o) are 
analytical functions in the entire complex ft) plane except for the real axis and have a 
spectral representation (see Eq. (68)) with non-negative diagonal elements of the spectral 
function. 

This trivially holds for the SFT self-energy ^t'.u.api^) since by construction 
'^t',u.ali{(^) is exact self-energy of a reference system. The SFT Green's function is 
obtained from the SFT self -energy and the free Green's function of the original model 
via Dyson's equation: 

G= .\ (133) 

It is easy to see that it is analytical in the complex plane except for the real axis. To 
verify that is has a spectral representation with non-negative spectral function, we can 
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equivalently consider the corresponding retarded quantity Gret(ft)) = G(aJ + zO+) for 
real frequencies co and verify that Gret = Gr — iGi with Gr, Gj Hermitian and Gj non- 
negative, i.e. JC^GjJC > for all jc''' = (. . . ,Xa, ■■■)■ 

This is shown as follows: We can assume that Go,ret = Go,R — iGqj with Go,r, 
Gq.i Hermitian and Go,i non-negative. Since for Hermitian matrices A, B with B non- 
negative, one has 1 / (A ± iB) =X^iY with X, Y Hermitian and Y non-negative (see Ref . 
[45]), we find GqI^^ = P^ + iPi with Pr, Pi Hermitian andPj non-negative. Furthermore, 
we have Ei-et = Er — iXi with Er, Hermitian and Ei non-negative. Therefore, 

" Pr + iPi - Er + iLi " Qr + /j2i ^^^"^^ 

with Qr Hermitian and Hermitian and non-negative. 

Note that the proof given here is simpler than corresponding proofs for cluster ex- 
tensions of the DMFT [52, 53] because the SFT does not involve a "self-consistency 
condition" which is the main object of concern for possible causality violations. 



7.2. Thermodynamical consistency 

An advantageous feature of the VGA and of other approximations within the SFT 
framework is their internal thermodynamical consistency. This is due to the fact that 
all quantities of interest are derived from an approximate but explicit expression for a 
thermodynamical potential. In principle the expectation value of any observable should 
be calculated by via 

(A) = ^ , (135) 

where Q. = Q.{t') is the SFT grand potential (see Eq. (130)) at t' = ^^pj and is a 
parameter in the Hamiltonian of the original system which couples linearly to A, i.e. 
Ht,u = Ho + XaA. This ensures, for example, that the Maxwell relations 

M = M (136) 

are respected. 

Furthermore, thermodynamical consistency means that expectation values of arbitrary 
one-particle operators A = La/3 Ag^^c^c^ can consistently either be calculated by a 
corresponding partial derivative of the grand potential on the one hand, or by integration 
of the one-particle spectral function on the other. As an example we consider the total 
particle number A'^ = La f ^Ca. A priori it not guaranteed that in an approximate theory 
the expressions 

W = -^, (137) 



and 



f{z)Aaa{z)dz (138) 
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with f{z) = 1/ (exp(/3z) + 1) and the spectral function A(^p (z) will give the same result. 

To prove thermodynamic consistency, we start from Eq. (137). According to Eq. 
(127), there is a twofold /i dependence ofQ. = Clt.u [^t'^^^p] ■ & the explicit /i dependence 

due to the chemical potential in the free Green's function of the original model, GJq — 
CO + n — t, and (ii) an implicit /i dependence due to the jU dependence of the self-energy 
T.fi u, the Green's function G,/ jj and the grand potential Q.fi u of the reference 
system: 

dD. do. 

{N) = -^ ^ . (139) 

Note that the implicit /i dependence is due to the chemical potential of the reference 
system which, by construction, is in the same macroscopic state as the original system 
as well as due to the /i dependence of the stationary point t'^^^ itself. The latter, however, 
can be ignored since 

do. dt' 

— ■ — =0 (140) 
dt' dfl 

for t' = fgpj because of stationarity condition Eq. (131). 

We assume that an overall shift of the one-particle energies e' = t'^^ is included in the 
set J^' of variational parameters. Apart from the sign this is essentially the "chemical 
potential" in the reference system but should be formally distinguished from jl since 
the latter has a macroscopic thermodynamical meaning and is the same as the chemical 
potential of the original system which should not be seen as a variational parameter. 

The self-energy and the Green's function of the reference system are defined as grand- 
canonical averages. Hence, their /i dependence due to the grand-canonical Hamiltonian 
= H' — jiN is (apart from the sign) the same as their dependence on e': Conse- 
quently, we have: 

da dQ ^ 
= -^=0 (141) 

due to the stationarity condition again. 

We are then left with the explicit /i dependence only: 

^ 1 1 

-Trln i = -Tr . (142) 



Converting the sum over the Matsubara frequencies implicit in the trace Tr into a contour 
integral in the complex ft) plane and using Cauchy's theorem, we can proceed to an 
integration over real frequencies. Inserting into Eq. (139), this yields: 



d(0 (143) 

co+iO+ 



for t' — fgpj which is just the average particle number given by (138). This completes the 
proof. 
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7.3. Symmetry breaking 



The above discussion has shown that besides intra-cluster hopping parameters it can 
also be important to treat one-particle energies in the reference cluster as variational 
parameters. In particular, one may consider variational parameters which lead to a lower 
symmetry of the Hamiltonian. 

As an example consider the Hubbard model on a bipartite lattice as the system of 
interest and disconnected clusters of size as a reference system. The reference-system 
Hamiltonian shall include, e.g., an additional a staggered magnetic-field term: 

^fict. = 5T^'K-'^/;)' (144) 

where z,- = -1-1 for sites on sublattice 1, and z, = — 1 for sublattice 2. The additional term 
leads to a valid reference system as there is no change of the interaction part. We 
include the field strength B' in the set of variational parameters, B' E 

B' is the strength of a fictitious field or, in the language of mean-field theory, the 
strength of the internal magnetic field or the Weiss field. This has to be distinguished 
clearly from an external physical field applied to the system with field strength B: 

Hphys. = 5^zK«it - (145) 

ia 

This term adds to the Hamiltonian of the original system. 

We expect B'^^^ = in case of the paramagnetic state and 5 = (and this is easily 
verified numerically). Consider the B' and B dependence of the SFT grand potential 
Q.{B',B) = t2g[Zg/]. Here we have suppressed the dependencies on other variational 
parameters t' and on t,U. Due to the stationarity condition, dQ.{B' ,B) / dB' = 0, the 
optimal Weiss field B' can be considered as a function of B, i.e. B'^^i = B\B) . Therefore, 
we also have: 

d da{B\B),B) 



dB dB' 



= 0. (146) 



This yields: 



d^a{B'{B),B) dB'jB) ^ d^a{B'{B),B) ^^^^^ 



dB'^ dB dBdB 

Solving for dB' /dB we find: 



dB' 

Ib 



dB 



.a 



dBdB' 



(148) 



This clearly shows that B' = B'^^^^ has to be interpreted carefully. B' can be much stronger 

than B if the curvature d^Q./dB'^ of the SFT functional at the stationary point is small, 
i.e. if the functional is rather flat as it is the case in the limit U ^0, for example. 
From the SFT approximation for the staggered magnetization, 

m = Iz,-((n,t-'^.i)) - j^^{B'{B\B) = ^^(B'm^B) ^^^^^ 



Static and dynamic variational principles for strongly correlated electron systems February 23, 2012 39 



where the stationarity condition has been used once more, we can calculate the suscep- 
tibility, 

dm _ dMB'{B),B)dB'{B) d^a{B'{B),B) 
^ dB dB'dB dB dB^ ■ ^ ^ 



Using Eq. (148), 



'^^-[d^^ [d^B^ ■ ''''' 



We see that there are two contributions. The first term is due to the explicit B dependence 
in the SFT grand potential while the second is due to the implicit B dependence via the 
B dependence of the stationary point. 

Eq. ( 1 5 1 ) also demonstrates (see Ref . [54]) that for the calculation of the paramagnetic 
susceptibility x one may first consider spin-independent variational parameters only 
to find a stationary point. This strongly reduces the computational effort [55]. Once a 
stationary point is found, partial derivatives according to Eq. (151) have to calculated 
with spin-dependent parameters in a single final step. 



7.4. Spontaneous symmetry breaking 

Spontaneous symmetry breaking is obtained at 5 = if there is a stationary point 
with 5qpj 7^ 0. Fig. 1 1 gives an example for the particle-hole symmetric two-dimensional 
Hubbard model on the square lattice at half-filling and zero temperature. As a reference 
system a cluster with Lc = 10 sites is considered, and the fictitious staggered magnetic 
field is taken as the only variational parameter. There is a stationary point at B' = 



-4.45 1' ' I ' ' ' ' I ' ' ' ' I ' ' ' ' I ' ' ' ' I ' ' ' ' I ' ' ' ' I ' ' 




B' 



FIGURE 11. (taken from Ref. [56]). SFT grand potential as a function of the strength of a fictitious 
staggered magnetic field B'. VCA calculation using disconnected clusters consisting of Lc = 10 sites each 
for the two-dimensional Hubbard model on the square lattice at half-filling, zero temperature, t/ = 8 and 
nearest-neighbor hopping f = 1 . 
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FIGURE 12. (taken from Ref. [56]). Comparison of the staggered magnetization m as a function of 
U at half-filling obtained by different methods: variational cluster approximation (VCA), variational 
Monte-Carlo (VMC) [57] and quantum Monte-Carlo (QMC) [58], see text. The arrow indicates the result 
m — Q.62± 0.04 of a Green's-function Monte-Carlo study [59] for the two-dimensional Heisenberg model. 



which corresponds to the paramagnetic phase. At 5' = the usual cluster-perturbation 
theory is recovered. The two equivalent stationary points at finite B' correspond to a 
phase with spontaneous antiferromagnetic order. As can be seen from the figure, the 
antiferromagnetic ground state is stable as compared to the paramagnetic phase. 

Its order parameter, i.e. the staggered magnetization m, is the conjugate variable to 
the physical field. Since the corresponding fictitious field is a variational parameter, m 
can either be calculated by integration of the spin-dependent spectral density or as the 
derivative of the SFT grand potential with respect to the physical field strength B with 
the same result. 

Fig. 12 shows the U dependence of m as calculated at the respective stable stationary 
point. As expected for the parameter regime considered here, a magnetically ordered 
ground state is obtained for any U > 0. The order parameter monotonously increases 
with increasing U and reaches a value close to m = 1 in the t/ — )■ oo limit. The trend 
agrees well with a variational iVlonte-Carlo calculation [57] which is a static mean- 
field approach based on the Ritz principle but includes correlation effects beyond the 
Hartree-Fock approximation by partially projecting out doubly occupied sites from a 
symmetry-broken trial wave function. It disagrees, however, with the result of auxiliary- 
field quantum IVIonte-Carlo calculations [58] for finite lattices. Within the QIVIC, the 
order parameter is obtained from simulations of the static spin-spin correlation function 
at low temperatures. It is assumed that the system effectively behaves as if at T = when 
the thermal correlation length exceeds the cluster dimensions. The VIVIC and QIVIC data 
displayed in Fig. 12 are extrapolated to infinite lattice size. 

In the U o° limit where the low-energy sector of the Hubbard model at half-filling 
can be mapped onto an antiferromagnetic Heisenberg model, the order parameter as 
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obtained within the VGA is overestimated. Physically, one would expect a reduction of 
the staggered magnetization from its saturated value m = 1 due to non-local transverse 
spin-spin correlations [60] as it is shown, for example, by a Greens's-function Monte- 
Garlo study [59] for the two-dimensional Heisenberg model (see arrow in Fig. 12). 

The VGA exactly takes into account electron correlations on a length scale given 
by the linear size of the reference cluster. The example shows, however, that longer- 
ranged correlations are neglected and that the approximation is like a static mean-field 
approximation beyond the cluster extension. It should be regarded as a cluster mean-field 
approach which is applicable if the physical properties are dominated by short-range 
correlations on a scale accessible by an exactly solvable finite cluster. 

The possibility to study spontaneous symmetry breaking using the VGA with suit- 
ably chosen Weiss fields as variational parameters has been exploited frequently in the 
past. Besides antiferromagnetism [56, 61, 62, 63], spiral phases [64], ferromagnetism 
[55], (i-wave superconductivity [65, 66, 67, 68, 69, 70, 71], charge order [72, 73] and 
orbital order [74] have been investigated. The fact that an explicit expression for a ther- 
modynamical potential is available allows to study discontinuous transitions and phase 
separation as well. 



7.5. Non-perturbative conserving approximations 

Gontinuous symmetries of a Hamiltonian imply the existence of conserved quanti- 
ties: The conservation of total energy, momentum, angular momentum, spin and particle 
number is enforced by a not explicitly time-dependent Hamiltonian which is spatially 
homogeneous and isotropic and invariant under global SU(2) and U(l) gauge transfor- 
mations. Approximations may artificially break symmetries and thus lead to unphysi- 
cal violations of conservations laws. Baym and Kadanoff [10, 11] have analyzed under 
which circumstances an approximation respects the mentioned macroscopic conserva- 
tion laws. Within diagrammatic perturbation theory it could be shown that approxima- 
tions which derive from an explicit but approximate expression for the Luttinger-Ward 
functional ^> ("4>-derivable approximations") are "conserving". Examples for conserv- 
ing approximations are the Hartree-Fock or the fluctuation-exchange approximation 
[10, 75]. 

The SFT provides a framework to construct ^>-derivable approximations for correlated 
lattice models which are non-perturbative, i.e. do not employ truncations of the skeleton- 
diagram expansion. Like in weak-coupling conserving approximations, approximations 
within the SFT are derived from the Luttinger-Ward functional, or its Legendre trans- 
form Fu [Z] . These are ^>-derivable since any type-III approximation can also be seen as 
a type-II one, see Section 4.3. 

For fermionic lattice models, conservation of energy, particle number and spin have 
to be considered. Besides the static thermodynamics, the SFT concentrates on the one- 
particle excitations. For the approximate one-particle Green's function, however, it is 
actually simple to prove directly that the above conservation laws are respected. A short 
discussion is given in Ref. [76]. 

At zero temperature T = there is another non-trivial theorem which is satisfied by 
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any ^»-derivable approximation, namely Luttinger's sum rule [9, 77]. This states that at 
zero temperature the volume in reciprocal space that is enclosed by the Fermi surface 
is equal to the average particle number. The original proof of the sum rule by Luttinger 
and Ward [9] is based on the skeleton-diagram expansion of 4> in the exact theory and 
is straightforwardly transferred to the case of a 4>-derivable approximation. This also 
implies that other Fermi-liquid properties, such as the linear trend of the specific heat at 
low T and Fermi-liquid expressions for the T = charge and the spin susceptibility are 
respected by a 4>-derivable approximation. 

For approximations constructed within the SFT, a different proof has to be found. One 
can start with Eq. (127) and perform the zero-temperature limit for an original system 
(and thus for a reference system) of finite size L. The different terms in the SFT grand 
potential then consist of finite sums. The calculation proceeds by taking the jU-derivative, 
for r = 0, on both sides of Eq. (127). This yields the following result (see Ref. [76] for 
details): 

{N) - W + 2£0(G,(O)) -2£0(G;t(O)) ■ (152) 
k k 

Here (A'^) {{N)') is the ground-state expectation value of the total particle number A'^ 
in the original (reference) system, and G^(0) (G|^(0)) are the diagonal elements of the 
one-electron Green's function G at ft) = 0. As Luttinger's sum rule reads 

(A^)=2£0(G;t(O)), (153) 
k 

this implies that, within an approximation constructed within the SFT, the sum rule is 
satisfied if and only if it is satisfied for the reference system, i.e. if (A^)' = 2 J^jt0(G^(O)). 
This demonstrates that the theorem is "propagated" to the original system irrespective of 
the approximation that is constructed within the SFT. This propagation also works in the 
opposite direction. Namely, a possible violation of the exact sum rule for the reference 
system would imply a violation of the sum rule, expressed in terms of approximate 
quantities, for the original system. 

There are no problems to take the thermodynamic limit L — )• oo (if desired) on both 
sides of Eq. (152). The k sums turn into integrals over the unit cell of the reciprocal 
lattice. For a D-dimensional lattice the D— 1 -dimensional manifold of k points with 
Gk{0) = oo or G;t(0) = form Fermi or Luttinger surfaces, respectively. Translational 
symmetry of the original as well as the reference system may be assumed but is not 
necessary. In the absence of translational symmetry, however, one has to re-interpret 
the wave vector k as an index which refers to the elements of the diagonalized Green's 
function matrix G. The exact sum rule generalizes accordingly but can no longer be 
expressed in terms of a Fermi surface since there is no reciprocal space. It is also valid 
for the case of a translationally symmetric original Hamiltonian where, due to the choice 
of a reference system with reduced translational symmetries, such as employed in the 
VGA, the symmetries of the (approximate) Green's function of the original system are 
(artificially) reduced. Since with Eq. (152) the proof of the sum rule is actually shifted to 
the proof of the sum rule for the reference system only, we are faced with the interesting 
problem of the validity of the sum rule for a finite cluster. For small Hubbard clusters 
with non-degenerate ground state this has been checked numerically with the surprising 
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result that violations of the sum rule appear in certain parameter regimes close to half- 
filling, see Ref. [76]. This leaves us with the question where the proof of the sum rule 
fails if applied to a system of finite size. This is an open problem that has been stated 
and discussed in Refs. [76, 78] and that is probably related to the breakdown of the sum 
rule for Mott insulators [79] . 



7.6. Systematics of approximations 

Since the SFT unifies different dynamical approximations within a single formal 
framework, the question arises how to judge on the relative quality of two different 
approximations resulting from two different reference systems. This, however, is not 
straightforward for several reasons. First, it is important to note that a stationary point 
of the self-energy functional is not necessarily a minimum but rather a saddle point 
in general (see Ref. [44] for an example). The self-energy functional is not convex. 
Actually, despite several recent efforts [80, 49, 81], there is no functional relationship 
between a thermodynamical potential and time-dependent correlation functions. Green's 
functions, self-energies, etc. which is known to be convex. 

Furthermore, there is no a priori reason why, for a given reference system, the SFT 
grand potential at a stationary point should be lower than the SFT grand potential at 
another one that results from a simpler reference system, e.g. a smaller cluster. This 
implies that the SFT does not provide upper bounds to the physical grand potential. 
There is e.g. no proof (but also no counterexample) that the DMFT ground-state energy 
at zero temperature must be higher than the exact one. On the other hand, in practical 
calculations the upper-bound property is usually found to be respected, as can be seen 
for the VGA in Fig. 9, for example. Nevertheless, the non-convexity must be seen as 
a disadvantage as compared to methods based on wave functions which via the Ritz 
variational principle are able to provide strict upper bounds. 

To discuss how to compare two approximations within SFT, we first have to distin- 
guish between "trivial" and "non-trivial" stationary points for a given reference system. 
A stationary point is referred to as "trivial" if the one-particle parameters are such that 
the reference system decouples into smaller subsystems. If, at a stationary point, all de- 
grees of freedom (sites) are still coupled to each other, the stationary point is called 
"non-trivial". It is possible to prove the following theorem [82]: Gonsider a reference 
system with a set of variational parameters t' = t" + V where V couples two separate 
subsystems. For example, V could be the inter-cluster hopping between two subclusters 
with completely decouples the degrees of freedom for V = and all t". Then, 

^t,u%"+v] = ^t,u[^t"+o] + ^{V^) , (154) 

provided that the functional is stationary at E,// when varying t" only (this restriction 
makes the theorem non-trivial). This means that going from a more simple reference 
system to a more complicated one with more degrees of freedom coupled, should 
generate a new non-trivial stationary point with V while the "old" stationary point 
with y = being still a stationary point with respect to the "new" reference system. 
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a) 



b) 



c) 



d) 




FIGURE 13. Possible trends of the SFT grand potential i2 as a function of a variational parameter Y 
coupling two subsystems of a reference system. V = corresponds to the decoupled case and must always 
represent a "trivial" stationary point. Circles show the stationary points to be considered. The point V = 
has to be disregarded in all cases except for a). The arrow marks the respective optimum stationary point 
according to the rules discussed in the text. 



Coupling more and more degrees of freedom introduces more and more stationary 
points, and none of the "old" ones is "lost". 

Consider a given reference system with a non-trivial stationary point and a number 
of trivial stationary points. An intuitive strategy to decide between two stationary points 
would be to always take the one with the lower grand potential Q-tn^t' Xl\ - ^ sequence 
of reference systems (e.g. //g, ...) in which more and more degrees of freedom 
are coupled and which eventually recovers the original system H itself, shall be called 
a "systematic" sequence of reference systems. For such a systematic sequence, the 
suggested strategy trivially produces a series of stationary points with monotonously 
decreasing grand potential. Unfortunately, however, the strategy is useless because it 
cannot ensure that a systematic sequence of reference systems generates a systematic 
sequence of approximations as well, i.e. one cannot ensure that the respective lowest 
grand potential in a systematic sequence of reference systems converges to the exact 
grand potential. Namely, the stationary point with the lowest SFT grand potential could 
be a trivial stationary point (like one associated with a very simple reference system 
only as //^ or //g in Fig. 14, for example). Such an approximation must be considered 
as poor since the exact conditional equation for the self-energy is projected onto a very 
low-dimensional space only. 

Therefore, one has to construct a different strategy which necessarily approaches the 
exact solution when following up a systematic sequence of reference systems. Clearly, 
this can only be achieved if the following rule is obeyed: A non-trivial stationary point 
is always preferred as compared to a trivial one (RO). A non-trivial stationary point 
at a certain level of approximation, i.e. for a given reference system becomes a trivial 
stationary point on the next level, i.e. in the context of a "new" reference system that 
couples at least two different units of the "old" reference system. Hence, by construction, 
the rule RO implies that the exact result is approached for a systematic series of reference 
systems. 

Following the rule (RO), however, may lead to inconsistent thermodynamic interpreta- 
tions in case of a trivial stationary point with a lower grand potential as a non-trivial one. 
To avoid this, RO has to be replaced by: Trivial stationary points must be disregarded 
completely unless there is no non-trivial one (Rl). This automatically ensures that there 
is at least one stationary point for any reference system, i.e. at any approximation level. 
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To maintain a thermodynamically consistent picture in case that there are more than a 
single non-trivial stationary points, we finally postulate: Among two non-trivial station- 
ary points for the same reference system, the one with lower grand potential has to be 
preferred (R2). 

The rules are illustrated by Fig. 13. Note that the grand potential away from a sta- 
tionary point does not have a direct physical interpretation. Hence, there is no reason 
to interpret the solution corresponding to the maximum in Fig. 13, c) as "locally unsta- 
ble". The results of Ref . [45] (see Figs. 2 and 4 therein) nicely demonstrate that with the 
suggested strategy (Rl, R2) one can consistently describe continuous as well as discon- 
tinuous phase transitions. 

The rules Rl and R2 are unambiguously prescribed by the general demands for 
systematic improvement and for thermodynamic consistency. There is no acceptable 
alternative to this strategy. The strategy reduces to the standard strategy (always taking 
the solution with lowest grand potential) in case of the Ritz variational principle because 
here a non-trivial stationary point does always have a lower grand potential as compared 
to a trivial one. 

There are also some consequences of the strategy which might be considered as 
disadvantageous but must be tolerated: (i) For a sequence of stationary points that 
are determined by Rl and R2 from a systematic sequence of reference systems, the 
convergence to the corresponding SFT grand potentials is not necessarily monotonous, 
(ii) Given two different approximations specified by two different reference systems, it 
is not possible to decide which one should be regarded as superior unless both reference 
systems belong to the same systematic sequence of reference systems. In Fig. 14, one 
has H'j^ < //g < //^ < where "<" stands for "is inferior compared to". Furthermore, 
//g < //p and H'p^ < but there is no relation between and H^, for example. 

8. BATH SITES AND DYNAMICAL MEAN-FIELD THEORY 

8.1. Motivation and dynamical impurity approximation 

Within self-energy-functional theory, an approximation is specified by the choice 
of the reference system. The reference system must share the same interaction part 
with the original model and should be amenable to a (numerically) exact solution. 
These requirements restrict the number of conceivable approximations. So far we have 
considered a decoupling of degrees of freedom by partitioning a Hubbard-type lattice 
model into finite Hubbard clusters of Lc sites each, which results in the variational cluster 
approximation (VGA). 

Another element in constructing approximation is to add degrees of freedom. Since 
the interaction part has to be kept unchanged, the only possibility to do that consists in 
adding new uncorrelated sites (or "orbitals"), i.e. sites where the Hubbard-iy vanishes. 
These are called "bath sites". The coupling of bath sites to the correlated sites with 
finite U in the reference system via a one-particle term in the Hamiltonian is called 
"hybridization". 

Fig. 14 shows different possibilities. Reference system A yields a trial self-energy 
which is local Lijaio)) = 5,;E((o) and has the same pole structure as the self-energy of 
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FIGURE 14. Different possible reference systems with the same interaction as the single-band Hubbard 
model on a square lattice. Filled circles: correlated sites with U as in the Hubbard model. Open circles: 
uncorrected "bath" sites with U = Q. Lines: nearest-neighbor hopping. Big circles: continuous bath 
consisting of Lt, = °° bath sites. Reference systems //^, //g, H'q generate variational cluster approximations 
(VCA), //g yields dynamical impurity approximation (DIA), //p the DMFT, and an intermediate 
approximation (VCA with one additional bath site per cluster). 

the atomic limit of the Hubbard model. This results in a variant of the Hubbard-I approx- 
imation [19]. Reference systems B and C generate variational cluster approximations. In 
reference system D an additional bath site is added to the finite cluster. Reference sys- 
tem E generates a local self-energy again but, as compared to A, allows to treat more 
variational parameters, namely the on-site energies of the correlated and of the bath site 
and the hybridization between them. We call the resulting approximation a "dynamical 
impurity approximation" (DIA) with L/^ = 1. 

The DIA is a mean-field approximation since the self-energy is local which indicates 
that non-local two-particle correlations, e.g. spin-spin correlations, do not feed back to 
the one-particle Green function. It is, however, quite different from static mean-field 
(Hartree-Fock) theory since even on the L}, = 1 level it includes retardation effects that 
result from processes where the electron hops from the correlated to the bath site 
and back. This improves, as compared to the Hubbard-I approximation, the frequency 
dependence of the self-energy, i.e. the description of the temporal quantum fluctuations. 
The two-site (L^ = 1) DIA is the most simple approximation which is non-perturbative, 
conserving, thermodynamically consistent and which respects the Luttinger sum rule 
(see Ref. [76]). Besides the "atomic" physics that leads to the formation of the Hubbard 
bands, it also includes in the most simple form the possibility to form a local singlet, 
i.e. to screen the local magnetic moment on the correlated site by coupling to the 
local moment at the bath site. The correct Kondo scale is missed, of course. Since 
the two-site DIA is computationally extremely cheap, is has been employed frequently 
in the past, in particular to study the physics of the Mott metal-insulator transition 
[44, 45, 83, 84, 85, 86, 87,76]. 
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8.2. Relation to dynamical mean-field theory 



Starting from E and adding more and more bath sites to improve the description of 
temporal fluctuations, one ends up with reference system F where a continuum of bath 
sites (Lh = oo) is attached to each of the disconnected correlated sites. This generates the 
"optimal" DIA. 

To characterize this approximation, we consider the SFT grand potential given by Eq. 
(127) and analyze the stationarity condition Eq. (131): Calculating the derivative with 
respect to t', we get the general SFT Euler equation: 

^LL L. 1/- X V 7-^-^t'.u{io)n)\ ^, = 0. (155) 

For the physical self-energy lLt,u of the original system Htu, the equation was fulfilled 

since the bracket would be zero. Vice versa, since Gti,ij = Gu[T.t'^u], the physical self- 
energy of Hf u is determined by the condition that the bracket be zero. Hence, one can 
consider the SFT Euler equation to be obtained from the exact conditional equation for 
the "vector" E in the self-energy space yy through projection onto the hypersurface 
of t' representable trial self-energies by taking the scalar product with vectors 
.u.ap {iO)„)/dt! tangential to the hypersurface. 
Consider now the Hubbard model in particular and the trial self-energies generated by 
reference system F. Actually, F is a set of disconnected single-impurity Anderson models 
(SIAM). Assuming translational symmetry, these impurity models are identical replicas. 
The self-energy of the SIAM is non-zero on the correlated ("impurity") site only. Hence, 
the trial self-energies are local and site-independent, i.e. Zij(y{i(On) = 5ijL{iO)n), and thus 
Eq. (155) reads: 

^Ee Lw. ' ^5i^ = 0, (156) 



n la 



iia 



where the notation has been somewhat simplified. The Euler equation would be solved 
if one-particle parameters of the SIAM and therewith an impurity self-energy can be 
found that, when inserted into the Dyson equation of the Hubbard model, yields a 
Green's function, the local element of which is equal to the impurity Green's function 
G' . Namely, the bracket in Eq. (156), i.e. the local (diagonal) elements of the bracket in 
Eq. (155), vanishes. Since the "projector" dlliicj{ico„)/dt' is local, this is sufficient. 

This way of solving the Euler equation, however, is just the prescription to obtain 
the self-energy within dynamical mean-field theory (DMFT) [25], and setting the local 
elements of the bracket to zero is just the self-consistency equation of DMFT. We 
therefore see that reference system F generates the DMFT. It is remarkable that with 
the VCA and the DMFT quite different approximation can be obtained in one and the 
same theoretical framework. 

Note that for any finite L^, as e.g. with reference system E, it is impossible to satisfy 
the DMFT self-consistency equation exactly since the impurity Green's function G' has 
a finite number of poles while the lattice Green's function (1/Gq ^(/ft),,) — E(?'(y„))/;c7 has 
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FIGURE 15. Quasiparticle weight z as a function of U for the half-filled Hubbard model at zero 
temperature as obtained by different approximations with local self-energy. Here the lattice structure 
enters the theory via the non-interacting density of states Pq{x) only. Po(-x) is assumed to be semielliptic 
with band width W = 4. BR: Brinkman-Rice (Gutzwiller) result [88]. 2S-DMFT: Two-site dynamical 
mean-field theory [89]. DIA: Dynamical impurity approximation with «.v = 1+^/7 = 2,4 local degrees of 
freedom [45]. DIA, «, = Full DMFT, evaluated using the DMFT-ED scheme with «, = 8 [44]. NRG: 
Full DMFT, evaluated using the numerical renormalization-group approach [90]. 



branch cuts. Nevertheless, with the "help" of the projector, it is easily possible to find a 
stationary point ?opt of the self-energy functional and to satisfy Eq. (156). 

Conceptually, this is rather different from the DIVIFT exact-diagonalization (DIVIFT- 
ED) approach [91] which also solves a SIAIM with a finite Lj-, but which approximates the 
DIVIFT self-consistency condition. This means that within DIMFT-ED an additional ad 
hoc prescription to necessary which, opposed to the DIA, will violate thermodynamical 
consistency. However, an algorithmic implementation via a self-consistency cycle to 
solve the Euler equation is simpler within the DIVIFT-ED as compared to the DIA 
[84, 87]. It has therefore been suggested to guide improved approximations to the self- 
consistency condition within DIMFT-ED by the DIA [92]. 

The convergence of results obtained by the DIA to those of full DIVIFT with increasing 
number of bath sites is usually fast. With respect to the U -T phase diagram for IVIott 
transition in the single-band half-filled Hubbard model, Lj-, = 3 is sufficient to get almost 
quantitative agreement with DIMFT-QIMC results, for example [84]. 

This is also demonstrated with Fig. 15 which shows the quasiparticle weight. 



1-JE(« = 0)/J£0' ' ' 

as a function of U for the Hubbard model with semielliptic free density of states (band 
width = 4) at half-filling and zero temperature. The Ivlott transition is marked by a 
divergence of the effective mass or, equivalently, by z = as i7 approaches the critical 
value from below. It is remarkable that already the ris = I + Lj, = 4-site DIA but also 
the Hs = 2- site DIA (which requires the diagonalization of a dimer model only) yields 
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z{U) in an almost quantitative agreement with the full DMFT. The Us = 2-site-DIA 
may also be compared with the "linearized" or "two-site" DMFT [93, 89] where the 
Hubbard model is mapped onto the two-site single-impurity Anderson model by means 
a strongly simplified self-consistency condition. As compared to the two-site DMFT, 
the self-energy-functional approach not only represents a clear conceptual improvement 
but also improves the actual results for z{U) and for Uc (see Fig. 15 and Ref. [44]). 
Any arbitrariness in the method to find the parameters of the impurity model is avoided 
completely and consistent results are obtained for any finite n^. 

8.3. Real-space dynamical mean-field theory 

Real-space DMFT [94, 95, 96, 97, 98, 99] with a local but site-dependent self- 
energy Zija{iO}n) = dijLi{iO)n) is obtained from reference system F if the one-particle 
parameters of the single-impurity Anderson models at different sites are allowed to be 
different. This becomes relevant for spatially inhomogeneous systems like semi-infinite 
lattices where surface effects can be studied or thin films or multilayers which are 
confined in one spatial direction or systems of fermions enclosed in an inhomogeneous 
external potential like gases of ultacold fermions in a confining harmonic potential. 

Treating the different impurity models as inequivalent means to enlarge the space of 
trial self-energies. Fig. 16 shows the generalized self-consistency cycle of real-space 
DMFT. The (inhomogeneous) lattice is partitioned into classes of sites which are equiv- 
alent due to residual spatial symmetries like, for example, lateral translational symmetry 
in case of a thin-film geometry. A class of equivalent sites is denoted by a. Within a 
given class the local self-energy is assumed to be site-independent while self-energies 
from different classes are allowed to be different. For a completely disordered system 
without any spatial symmetry, the self-energy is fully site-dependent. It this case a 
is a site index and the theory becomes equivalent with the so-called statistical DMFT 



lattice model 




impurity models 

FIGURE 16. Generalized self-consistency cycle of the real-space dynamical mean-field theory, see text 
for discussion. 
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[100, 101, 102, 103]. 

As is sketched by Fig. 16, we start from a guess for the self-energy for each 

class of sites. With this and with the free Green's function of the inhomogeneous lattice 
model, we get the interacting lattice Green's function Gija[i(On) by solving Dyson's 
equation for the lattice model. Due to the reduced spatial symmetry, this is a matrix 
equation with a dimension given by the number of classes of inequivalent sites (X. 
Eq. (156) then provides us with one local DMFT self-consistency equation for each 
class which fixes the parameters of each of the inequivalent single-impurity Anderson 
models. The final and most important step in the cycle consists in solving the impurity 
models to get the self-energies Ea(?'(U„) for all a. The different impurity models can be 
solved independently from each other in each step of the self-consistency cycle while the 
coupling between the different sites or classes of sites is provided by the Dyson equation 
of the lattice model. 



8.4. Cluster mean-field approximations 

As a mean-field approach, the DIA does not include the feedback of non-local two- 
particle correlations on the one-particle spectrum and on the thermodynamics. The DIA 
self-energy is local and takes into account temporal correlations only. A straightforward 
idea to include short-range spatial correlations in addition, is to proceed to a reference 
system with Lc > 1, i.e. system of disconnected finite clusters with Lc sites each. The 
resulting approximation can be termed a cluster mean-field theory since despite the 
inclusion of short-range correlations, the approximation is still mean-field-like on length 
scales exceeding the cluster size. 

For an infinite number of bath degrees of freedom — )■ oo attached to each of the Lc > 
1 correlated sites the cellular DMFT [53, 104] is recovered, see Fig. 17. Considering 
a single-band Hubbard model again, this can be seen from the corresponding Euler 
equation: 

TYtl—. G'iicon)] ^%^ = 0, (158) 

where the prime at the sum over the sites indicates that i and j must belong to the 
same cluster of the reference system. Namely, llij(j{iC0n) = and also the "projector" 
d'Ljia{iO)n) /di' = if / and j belong to different clusters. This stationarity condition can 
be fulfilled if 

I — \ ^- G'{mn)\ =0. (159) 

Note that G'(zft)„) is a matrix which is labeled as G|y^/(z(0„) where z, j = l,...,Lc refer 
to the correlated sites in the cluster while k,l = l,...,Li,Lc to the bath sites. Gyy(z(0„) are 
the elements of the cluster Green's function on the correlated sites. The condition Eq. 
(159) is just the self-consistency condition of the C-DMFT. 
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FIGURE 17. Dynamical impurity and cluster approximations generated by different reference systems 
within SFT. Lc is the number of correlated sites in the reference cluster «, = 1 + is the number of 
local degrees of freedom where denotes the number of additional bath sites attached to each of the Lc 
correlated sites. The variational cluster approximation (VCA) is obtained for finite clusters with Lc > 1 
but without bath sites «s = 1 . This generalizes the Hubbard-I-type approximation obtained for Lc = I. The 
dynamical impurity approximation (DIA) is obtained for «s > 1 but for a single correlated site Lc = 1. 
A continuum of bath sites, «s = °° generates DMFT (Lc — 1) and cellular DMFT {Lc > 1). Lc — °°, 
irrespective of the number of local degrees of freedom, corresponds to the exact solution. 



As is illustrated in Fig. 17, the exact solution can be obtained with increasing cluster 
size Lc —7- oo either from a sequence of reference systems with a continuous bath Hs = 
1 + = oo, corresponding to C-DMFT, or from a sequence with = 1 , corresponding 
to VCA, or with a finite, small number of bath sites ("cellular DIA"). Systematic studies 
of the one-dimensional Hubbard model [46, 50] have shown that the energy gain which 
is obtained by attaching a bath site is lower than the gain obtained by increasing the 
cluster. This suggests that the convergence to the exact solution could be faster on the 
"VCA axis" in Fig. 17. For a definite answer, however, more systematic studies, also in 
higher dimensions, are needed. 

In any case bath sites help to get a smooth dependence of physical quantities when 
varying the electron density or the (physical) chemical potential. The reason is that 
bath sites also serve as "charge reservoirs", i.e. during a ji scan the ground state of 
the reference cluster may stay in one and the same sector characterized by the conserved 
total particle number in the cluster while the particle number on the correlated sites 
and the approximate particle number in the original lattice model evolve continuously 
[50, 55]. This is achieved by a /i-dependent charge transfer between correlated and bath 
sites. In addition, (at least) a single bath site per correlated site in a finite reference cluster 
is also advantageous to include the interplay between local (Kondo-type) and non- 
local (antiferromagnetic) singlet formation. This has been recognized to be important in 
studies of the Mott transition [105] in the two-dimensional and of ferromagnetic order 
in one-dimensional systems [55]. For studies of spontaneous U(l) symmetry breaking, 
e.g. J-wave superconductivity in the two-dimensional Hubbard model [65, 66, 67, 68], 
doping dependencies can be investigated without bath sites due to mixing of cluster 
states with different particle numbers. 

To benchmark the cluster mean-field approximations that can be constructed from 
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FIGURE 18. (taken from Ref. [50]) Local Green's function on the Matsubara frequencies, Giia{i(0„), 
for U = 6 using the Lc = 2, ris = 3 and the Lc = 4, «i = 3 reference systems (red solid lines). Zero- 
temperature results for the one-dimensional Hubbard model at half-filling with nearest-neighbor hopping 
set to f = 1 are compared to corresponding dynamical DMRG data (black solid line), Lc = 2 cellular 
DMFT at finite but low temperature (/3 = 20) and Lc = 2 cluster dual-fermion theory (cluster DF, j3 = 20) 
from from Ref. [106]. 



the dynamical variational principle, it is instructive to compare with numerically exact 
data. Essentially exact results from the dynamical density-matrix renormalization-group 
approach (DIVIRG) are available for the one-dimensional Hubbard model at zero tem- 
perature (see Ref. [106]). A one-dimensional model with strong spatial correlations is 
actually the least favorable limit to start off with a cluster approach. Fig. 18 shows the 
result for the local one-particle Green's function on the IVIatsubara frequencies as ob- 
tained from the variational cluster approximation with Lc = 2 correlated sites and with 2 
bath sites per correlated site in addition (ris = 3). While the overall trend is reproduced 
well, there are also apparent discrepancies at intermediate frequencies. The comparison 
with the cellular DIVIFT result using weak-coupling continuous-time QIVIC at finite tem- 
perature (1/r = /3 = 20), however, shows that there is hardly any improvement when 
a continuum of bath sites is attached (n^ = oo), i.e. the main effect is captured with a 
very small number of local degrees of freedom already. The cluster dual-fermion 
method [106] on the other hand represents an approach that goes beyond the cluster 
mean-field theory by including diagrams contributing to the inter-cluster elements of the 
self-energy. For Lc = 2 this is clearly superior as compared to the VGA for Lc = 2 (and 
/3 = 20) and already quite close to the DIVIRG data. It should be noted, however, that 
the VGA calculations are computationally much cheaper. Using a larger cluster, i.e. an 
Lc = 4 reference system with bath sites (n, = 3) attached to the cluster edges only, the 
VGA Green's function can hardly be distinguished from the dynamical DIVIRG result. 
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8.5. Translation symmetry 



For any cluster approximation formulated in real space there is an apparent problem: 
Due to the construction of the reference system as a set of decoupled clusters, the 
trial self-energies do not preserve the translational symmetries of the original lattice. 
Trivially, this also holds if periodic boundary conditions are imposed for the individual 
cluster. Transformation of the original problem to reciprocal space does not solve the 
problem either since this also means to transform a local Hubbard-type interaction into 
a non-local interaction part which basically couples all k points. 

There are different ideas to overcome this problem. We introduce a "periodizing" 
universal functional 

^P]'/ = 7E^'W,/'-/^''/ (160) 

which maps any trial self-energy onto translationally invariant one. In reciprocal space 
this corresponds to the substitution E^. /^i — )■ T[T.]i^ = 5^. j^/Ej^. Using this, we replace the 
self-energy functional of Eq. (125) by 

a;;),[£]^Trln ^ (161) 

as suggested in Ref. [107]. This new functional is different from the original one. 
However, as the physical self-energy is supposed to be translational invariant, it is a 
stationary point of both, the original and the modified functional. This means that the 
modified functional can likewise be used as a starting point to construct approximations. 
It turns out (see Ref. [103] for an analogous discussion in case of disorder) that for a 
reference system with > 1 and = oo, the corresponding Euler equation reduces to 
the self-consistency equation of the so-called periodized cellular DMFT (PC-DMFT) 
[108]. The same modified functional can also be used to construct a periodized VGA, 
for example. 

While the main idea to recover the PC-DMFT is to modify the form of the self-energy 
functional, the dynamical cluster approximation (DCA) [109, 110, 52] is obtained with 
the original functional but a modified hopping term in the Hamiltonian of the original 
system. We replace t ^ t and consider the functional fly^^p]. To ensure that the 
resulting approximations systematically approach the exact solution for cluster size 
Lc — )■ o°, the replacement t ^ t must be controlled by Lc, i.e. it must be exact up to 
irrelevant boundary terms in the infinite-cluster limit. This is the case for 

7= {yw)uhu{yw)^ , (162) 

where U, V, and W are unitary transformations of the one-particle basis. U is the Fourier 
transformation with respect to the original lattice consisting of L sites (Lx L unitary 
matrix). W is the Fourier transformation on the cluster (L^ x L^), and V the Fourier 
transformation with respect to the superlattice consisting of L/Lc supersites given by 
the clusters (L/Lc x L/Lc). The important point is that for any finite the combined 
transformation VW = WV ^ U, while this becomes irrelevant in the limit Lc — )■ °°. The 
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detailed calculation (see Ref. [103] for the analogous disorder case) shows that the DCA 
is recovered for a reference system with with > 1 and = oo, if periodic boundary 
conditions are imposed for the cluster. The same modified construction can also be 
used to a get simplified DCA-type approximation without bath sites, for example. This 
"simplified DCA" is related to the periodized VGA in the same way as the DCA is related 
to the PC-DMFT. The simplified DCA would represent a variational generalization of a 
non-self-consistent approximation ("periodic CPT") introduced recently [111]. 

9. CONCLUDING DISCUSSION 

The physical properties of strongly correlated electron systems are often dominated by a 
complex interplay of a macroscopically large number of degrees of freedom that cannot 
be described by means of brute-force numerical methods: The exact-diagonalization 
or Lanczos technique [112, 113] is restricted to systems with moderately large Hilbert 
space dimensions only, the numerical renormalization-group approach [114] as well 
as the density-matrix renormalization group [115, 116] are designed for impurity or 
one-dimensional models only, and the quantum Monte-Carlo technique [117, 118, 11 9] 
suffers from the famous sign problem for most of the two- or higher-dimensional lattice 
fermion models. 

A completely different approach to strongly correlated systems is offered by mean- 
field theory. Common to the huge number of different versions of mean-field concepts is 
that certain correlations between the degrees of freedom are neglected to make the prob- 
lem tractable analytically or accessible to one of the aforementioned numerical meth- 
ods. Hartree-Fock theory is one of the most clear and also one of the most frequently 
employed mean-field approaches. For an interacting A^^-electron system, the Hartree- 
Fock approximation decouples the dynamics of a particular electron from the rest of 
the system by setting up an effective one-particle Schrodinger equation which includes 
an effective one-electron potential which is self-consistently produced by the remaining 
A'' — 1 electrons. 

We have seen that a very elegant way to derive the Hartree-Fock theory for a system 
with a rather general second-quantized Hamiltonian is to employ a generalized version 
of the Ritz variational principle 6Q.t,u [p] = 0. To get an ansatz for the density operator p, 
it has turned out to be convenient in general to assume p to be given as the exact density 
operator p,/ jy/ of a certain reference system with different one-particle and interaction 
parameters t' and U' . Hartree-Fock theory is then obtained with U' = but t' arbitrary, 
i.e. the trial density operator is parameterized by t' and the grand potential is optimized 
by varying p,/ o through variation of t'. 

The essential point why this concept is working at all, consists in the fact that due 
to Wick's theorem the density-operator functional Q.t^u[p] can be evaluated exactly on 
the subspace of trial density operators while this is generally impossible in practice for 
an arbitrary p. This is the reason why other approaches based on the static variational 
principle, like the Gutzwiller approach [20, 120], for example, have to employ additional 
approximations or, like the variation of matrix-product states [116], need additional 
heavy numerical tools and are restricted to certain limits, as e.g. to one-dimensional 
models. 
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Loosely speaking, the dynamical variational principle states that the grand potential 
of an interacting electron system is stationary at the physical Green's function or the 
physical self-energy. It actually goes without saying that this crucially depends on the 
functional form of Q.t u [E] . In particular, to formulate a dynamical variational principle, 
it is inevitable to involve the Luttinger-Ward functional in the one or other way or 
similar functionals such as its Legendre transform. The Luttinger-Ward functional can 
either be defined order by order in diagrammatic perturbation theory or by means 
of the fermionic path integral. Likewise, the dynamical variational principle can be 
formulated in different equivalent variants, namely as stationarity of the grand potential 
as a functional of the Green's function, of the self-energy or as a functional of both. 
A slight conceptual advantage of the self-energy-functional approach is that dynamical 
mean-field theory can be obtained by merely restricting the domain of the functional, 
i.e. in the same way as Hartree-Fock theory is obtained from the static Ritz principle. 

The dynamical variational principle is in fact known since long [9] but has not been 
employed to construct non-perturbative approximations until recently [44] . In fact, a di- 
rect evaluation of the Luttinger-Ward functional for a given Green's function is a hope- 
less task since this implies the full summation of skeleton diagrams of arbitrary order. 
The essential point is that to make use of the principle one can proceed analogously to 
the construction of static mean-field theory from the Ritz principle and choose a ref- 
erence system that spans a certain space of trial self-energies such that the self-energy 
functional can be evaluated exactly there. This is in fact possible if reference systems 
are considered that share with the original system the same interaction part but are ac- 
cessible to a numerically exact solution. The underlying reason is that the form of the 
Luttinger-Ward functional is exactly the same in both cases as this parametrically de- 
pends on the vertices only and that therewith the functional can be evaluated for Green's 
functions of the reference system at least. 

For both, the static and the dynamical variational principle, type-Ill approximations, 
i.e. approximations that merely result from a restriction of the domain of the functional 
as specified by the choice of a reference system, can be constructed in a very similar way 
- at least formally. In practice, however, the resulting approximations are very different. 
For the static variational principle, the only meaningful reference system that still allows 
for an exact evaluation is given by £/' = but t' arbitrary. This implies that one is looking 
for the optimal non-interacting density operator for an interacting system and what is 
found is Hartree-Fock theory. For the dynamical variational principle, on the other hand, 
the only meaningful reference systems are those where U' = U and t' such that the 
reference system's Hamiltonian describes a set of spatially decoupled subsystems with 
small Hilbert-space dimension respectively. The prototypical example is the variational 
cluster approximation or, if local variational degrees of freedom are added, the (cellular) 
dynamical mean-field theory. Hence, while static theory produces non-local but also 
effectively non-interacting density operators, the dynamical approach leads to local but 
non-perturbative self-energies. 

A different way of stating this is to say that dynamical objects, the self-energy or the 
Green's function, are necessarily to be considered as the basic quantities of the theory if 
one aims at an embedding approach, i.e. at an approach where one likes to expand around 
a local or cluster model and where such an expansion is sensible as the dominating 
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correlations are local. Note that the same concept fails if one tries to base this on the 
static variational principle: Choosing with t' and U a reference system of decoupled 
sites or clusters with the same interaction part as the original (Hubbard-type) model, 
spans a space of density operators which factorize spatially. This implies, however, that 
any spatial correlation function vanishes completely. This would represent a too strong 
approximation that cannot be tolerated in any circumstance. 

It is interesting to note that only the dynamical variational principle allows for a 
systematic improvement of approximations, namely simply by enlarging the cluster 
size in the reference system. This ensures that, at least in principle, one can approach 
the exact solution. Opposed to this, there is no way of improving on the Hartree-Fock 
approximation without giving up the concept of type-III approximations. For any finite 
U' in the reference system. Wick's theorem is no longer applicable. 

Another quite important difference between the static and the dynamical approach 
shows up for systems with non-local interaction, i.e. for models beyond the simple 
Hubbard model. While nothing actually happens to the formal construction of static 
mean-field theory, a dynamical mean-field theory can no longer be formulated from the 
dynamical variational principle: There is simply no reference system at all that can be 
solved numerically exact: As we have seen, to circumvent the practically impossible di- 
rect evaluation of the Luttinger-Ward functional, U' = U must be assumed, and thereby 
the reference system's degrees of freedom remain coupled. An interesting possible way 
out which is worth to be explored in the future, is to set up a generalized dynamical vari- 
ational principle where the U -dependent but ^-independent Luttinger-Ward functional 
is replaced by a truly universal, i.e. t and fZ-independent functional. This would allow 
to consider reference systems with arbitrary parameters t' and U' and with U' ^ U in 
particular. 

Finally, it should be stressed that despite all the close analogies there is no known strict 
correspondence between the static and the dynamical variational principle and no corre- 
spondence between the approximations constructed within the different frameworks. For 
example, the density operator of the system's thermal state within dynamical mean-field 
theory cannot be specified explicitly. Within a dynamically constructed approximation, 
such as DMFT, the approximate quantum state of the original lattice system that cor- 
responds to the DMFT Green's function or self-energy is not known. Vice versa, static 
mean-field theory cannot be derived from the dynamical variational principle as a type- 
III approximation: Choosing a reference system with U' = implies to shrink the space 
of trial self-energies to a single point E = which is different from the Hartree-Fock 
self-energy. The missing one-to-one correspondence between static and dynamical ap- 
proaches also reflects itself in the fact that neither the self-energy functional nor any of 
the suggested variants are generally concave (or convex). Stationary points are usually 
saddle points instead of local or global extrema. Accordingly, dynamical approximations 
cannot be shown to provide strict upper bounds to the true grand potential of the system. 
This is opposed to any static approximation, including even the crudest ones. 
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